Which of the following options best describes the differences between a traditional data warehouse environment and a Hadoop environment?
A. Traditional data warehousing environments are mostly ideal for analyzing structured data from various systems, while a Hadoop environment is well suited to deal with structured, semi- structured, and unstructured data, as well as when a data discovery process is needed.
B. Hadoop environments are mostly ideal for analyzing structured and semi-structured data from a single system, while traditional data warehousing environment is well suited to deal with unstructured data, as well as when a data discovery process is needed.
C. Typically, data stored in Hadoop environments is cleaned up before storing in the distributed file-system.
D. Typically, data stored in data warehousing environments is rarely filtered and pre-processed. On the other hand, data injected into Hadoop environments is always pre-processed and filtered.
Which of the following options DOES NOT describe an advantage of InfoSphere BigInsights on top of regular standalone Hadoop distributions?
A. Integrated install
B. BigSheets
C. Adaptive MapReduce
D. Support to HiveQL and PigLatin languages.
How do big data solutions interact with the existing enterprise infrastructure?
A. Big data solutions must substitute the existing enterprise infrastructure; therefore there is no interaction between them.
B. Big data solutions are only plug-ins and additions to existing data warehouses, and therefore cannot work with any other enterprise infrastructure.
C. Big data solutions must be isolated into a separate virtualized environment optimized for sequential workloads, so that it does not interact with existing infrastructure.
D. Big data solutions works in parallel and together with the existing enterprise infrastructure, where pre-existing connectors are used to integrate big data technologies together with other enterprise solutions.
Which of the following methods can be used by Cognos to connect with InfoSphere BigInsights? (Choose two)
A. Connection via the Hive JDBC driver.
B. Connection via the SPL JDBC driver.
C. Connection via the Pig JDBC driver.
D. Connection via the Big SQL JDBC driver.
What is the difference between Hadoop MapReduce and IBM Adaptive MapReduce feature available in InfoSphere BigInsights?
A. Hadoop Map Reduce is optimized for operating on small files or splits, while IBM Adaptive MapReduce is optimized for operating on largepartitioned files.
B. Hadoop MapReduce is optimized for operating on large files, while IBM Adaptive MapReduce is configurable to operate optimized on large or small files or splits
C. Hadoop MapReduce is optimized for operating on small files or splits, while IBM Adaptive MapReduce is optimized for operating on large files stored in individual blocks.
D. Hadoop MapReduce is optimized for operating on small partitioned tables stored in the HBase component, while IBM? Adaptive MapReduceis optimized for operating on large partitioned files.
InfoSphere BigInsights offers the following benefits to your organization, EXCEPT:
A. It cuts costs by providing an efficient compression algorithm based on row level compression using a dictionary to store repetitive patterns.
B. It complements your existing infrastructure by extending your data collection and analysis capabilities.
C. It integrates data from a variety of structured and unstructured sources.
D. It enables analysis at scale.
Which of the following options is NOT CORRECT?
A. InfoSphere BigInsights offers great value to organizations that are dealing with Internet-scale volumes (peta bytes) of data that exists in many different formats.
B. InfoSphere BigInsights facilitates the installation, integration, and monitoring of Hadoop systems.
C. InfoSphere BigInsights is offered in different editions: Basic, Limited Developer, and Enterprise.
D. InfoSphere BigInsights helps organizations quickly build and deploy custom analytics and workloads to capture insight from big data that can then be integrated into existing database, data warehouse, and business intelligence infrastructures.
The analytic capabilities that come built-in with the Big Data platform include:
A. Image and Video
B. Acoustic
C. Financial
D. All of the above
What is a major difference between HDFS and GPFS-FPO?
A. There is no difference, GPFS-FPO is just IBM version of HDFS in BigInsights.
B. There is no difference, as GPFS-FPO is the new open source standard version for the Hadoop file system.
C. Unlike HDFS, GPFS-FPO is a kernel level, POSIX compliant, and highly available file system that allows for standard applications to also use the storage area marked for Hadoop use.
D. Unlike HDFS, GPFS-FPO is a kernel level, POSIX compliant file system that allows for standard applications to also use the storage area marked for Hadoop use. Like HDFS, GPFS- FPO also has an issue with the meta data name node service which may be a single point of failure.
Which of the following compression algorithms is used by InfoSphere BigInsights to provide an additional compression option over the ones that come with the base Hadoop distribution?
A. gzip.
B. brzip2.
C. lza.
D. lzo.