Which of the following are CRUD operations available in HBase? (Choose two.)
A. HTable.Put
B. HTable.Read
C. HTable.Delete
D. HTable.Update
E. HTable.Remove
When we create a new table in Hive, which clause can be used in HiveSQL to indicate the storage file format?
A. SAVE AS
B. MAKE AS
C. FORMAT AS
D. STORED AS
PCI compliance requirements allow the use of real customer data during testing and development only when:
A. The data is only processed in memory
B. Customer data is never allowed in testing or development
C. The data is never stored longer than 24 hours in the test system
D. The data is only stored in volatile storage that expires on power loss
What does the acronym "PCI" stand for in the phrase "PCI compliant"?
A. Payment Card Industry
B. Personal Credit and Income
C. Premium Credit Inspection
D. Proactive Controls Implementation
For a customer satisfaction application, a large eCommerce company is loading click stream data into MySQL and running Cognos Reports which are accessed by 50 plus customer service reps (CSR). The IT team decided to move the application to the new BigInsights-based Hadoop cluster and load the data into a Hive Warehouse and run the queries. Which one of the following would enable them to reuse the SQL queries without extensive rewrite?
A. Pig
B. NoSQL
C. HiveQL
D. Big SQL
Which command can be used to move data from HDFS into an existing table in a relational database?
A. sqoop
B. hadoop fs -put
C. hadoop fs -copyToLocal
D. hadoop fs -copyFromLocal
A large Telecom company wants to store data from multiple databases into Hadoop. They plan to do bulk loads of data into Hadoop and run analytical queries. Which data store would be ideal for this scenario?
A. Hive
B. HBase
C. BigSheets
D. Apache Spark
For which level of Apache Spark data locality is the data closest to the code that processes it?
A. ANY
B. NO_PREF
C. BACK_LOCAL
D. NODE_LOCAL
E. PROCESS_LOCAL
In which of the following the Flume open-source software is written?
A. C / C++
B. Primarily Java
C. Python and related tools
D. A combination of C++ and Perl
Which one of the following file formats is optimal for querying tables with many columns and performing aggregation operations such as SUM() and AVG()?
A. Text
B. Avro
C. JSON
D. PARQUET