HaDoop Test Questions & Answers
What project was Hadoop originally a part of and what idea was that project
based on? - >>>>Nutch. It was based on returning web search results faster by distributing data
and calculati
...
HaDoop Test Questions & Answers
What project was Hadoop originally a part of and what idea was that project
based on? - >>>>Nutch. It was based on returning web search results faster by distributing data
and calculations across different compters.
Fill in the blank. The solution to cataloging the increasing number of web pages
in the late 1900's and early 2000's was _______. - >>>>Automation
T/F: The Cassandra File System has many advantages over HDFS,
but simpler deployment is not one of them. - >>>>False
Name three features of Hive. - >>>>HiveQL, Indexing, Different Storage types
Describe how Sqoop transfers data from a relational database to Hadoop. - >>>>Runs a query on
a relational database and exports into files in a variety of formats. They are then saved on HDFS.
Lists three drawbacks of using Hadoop. - >>>>Does not work well with small amounts of data,
MapReduce is difficult to implement or understand, does not guarantee atomicity transactions
Explain why the performance of join operation in Hadoop is inefficient. - >>>>HaDoop does not
have indicies so the entire dataset is copied in the join operation.
T/F: Hadoop is not recommended to company with small amount of
data but it is highly recommended if this data requires instance analysis. - >>>>False
FIll in the blank. Hadoop lacks notion of ________ and _______.
Therefore, the analyzed result generated by Hadoop may or may not be 100%
accurate. - >>>>Transaction Consistency, Recovery Checkpoint
T/F: The main benefit of HadoopDB is that it is more scalable than
Hadoop while maintaining the same performance level on structured data
analysis workloads. - >>>>False
Which of the following is NOT Hadoop drawbacks?
A) inefficient join operation
B) security issue
C) does not optimize query for user
D) high cost
E) MapReduce is difficult to implement - >>>>D
The command for removing a file from hadoop recursively is:
hadoop dfs ___________ - >>>>-rmr
The __________ holds the data in the HDFS and the application
connects with the __________ to send and retrieve data from the cluster. - >>>>Datanode,
Namenode
T/F: Hadoop is open source. - >>>>True
Which of the following is NOT true:
A) Hadoop is decentralized
B) Hadoop is distributed.
C) Hadoop is open source.
D) Hadoop is highly scalable. - >>>>
[Show More]