Which of the following statements regarding Big SQL is TRUE?
A. Big SQL doesn't support stored procedures
B. Big SQL can be deployed on a subset of data nodes in the BigInsights cluster
C. Big SQL provides a SQL-on-Hadoop environment based on map reduce
D. Only tables created or loaded via Big SQL can be accessed via Big SQL
In order for an SPSS Modeler stream to be incorporated for use in an InfoSphere Streams application leveraging SPSS Modeler Solution Publisher, you need to:
A. add a Type node
B. insert any Output node
C. add a Table node as the terminal node
D. Make the terminal node a scoring branch
When we create a new table in Hive, which clause can be used in HiveSQL to indicate the storage file format?
A. SAVE AS
B. MAKE AS
C. FORMAT AS
D. STORED AS
Which one of the following is NOT provided by the SerDe interface?
A. SerDe interface has to be built using C or C++ language
B. Allows SQL-style queries across data that is often not appropriate for a relational database
C. Serializer takes a Java object that Big SQL has been working with, and turns it into a format that BigSQL can write to HDFS
D. Deserializer interface takes a string or binary representation of a record, and translates it into a Java
object that Big SQL can manipulate
Which of the following Big SQL statements is valid?
A. CREATE TABLE t1 WITH CS;
B. WITH t1 AS (...) (SELECT * FROM t1 WITH RR USE AND KEEP SHARE LOCKS) UNION ALL (SELECT * FROM t1 WITH UR);
C. SELECT deptno, deptname, mgrno FROM t1 WHERE admrdept =`A00' FOR READ ONLY WITH RS USE AND KEEP EXCLUSIVE LOCKS
D. ALTER TABLE t1 ALTER COLUMN deptname SET DATA TYPE VARCHAR(100) USE AND KEEP UPDATE LOCKS
What is a method for loading RDBMS data into an HBase table?
A. HDFS LOAD
B. SQOOP IMPORT
C. LOAD HADOOP USING
D. Hadoop jar hbase-VERSION.jar importtsv
Which of the following statement is TRUE with BigSheets?
A. You can create any type of sheet from a parent workbook
B. You must create a child workbook in order to create a chart
C. You can delete a parent workbook without deleting the child workbooks
D. You must run the workbook on the data to get the full results of the analysis
Which of the following statements regarding importing streaming data from InfoSphere Streams into Hadoop is TRUE?
A. InfoSphere Streams can only write to HDFS not read from HDFS
B. InfoSphere Streams can only write directly to BigInsights, not other Hadoop distributions like Hortonworks or Cloudera
C. A Streams developer needs to account for the fact that BigInsights may not be able to absorb the incoming streams at the rate InfoSphere Streams is sending them
D. Adding a Big Data toolkit operator (for writing to Hadoop) to an InfoSphere Streams Processing Language (SPL) application requires that the SPL application be recompiled
Which of the following statements regarding importing streaming data from InfoSphere Streams into Hadoop is TRUE?
A. InfoSphere Streams utilizes Flume to interface to Hadoop
B. The HDFSFileSink operator writes files in parallel to a Hadoop Distributed File System
C. Buffering techniques are used to process incoming streams from InfoSphere Streams
D. When you use the HDFS operators to access GPFS, you must install InfoSphere Streams on an InfoSphere Big Insights data node
Which of the following is not a data-processing operations that is supported in Pig Latin?
A. filter
B. joins
C. group by
D. logistic regression