Computer Science > QUESTIONS & ANSWERS > HaDoop Test Questions & Answers (All)

HaDoop Test Questions & Answers

Document Content and Description Below

HaDoop Test Questions & Answers What project was Hadoop originally a part of and what idea was that project based on? - >>>>Nutch. It was based on returning web search results faster by distributing data and calculations across different compters. Fill in the blank. The solution to cataloging the increasing number of web pages in the late 1900's and early 2000's was _______. - >>>>Automation T/F: The Cassandra File System has many advantages over HDFS, but simpler deployment is not one of them. - >>>>False Name three features of Hive. - >>>>HiveQL, Indexing, Different Storage types Describe how Sqoop transfers data from a relational database to Hadoop. - >>>>Runs a query on a relational database and exports into files in a variety of formats. They are then saved on HDFS. Lists three drawbacks of using Hadoop. - >>>>Does not work well with small amounts of data, MapReduce is difficult to implement or understand, does not guarantee atomicity transactions Explain why the performance of join operation in Hadoop is inefficient. - >>>>HaDoop does not have indicies so the entire dataset is copied in the join operation. T/F: Hadoop is not recommended to company with small amount of data but it is highly recommended if this data requires instance analysis. - >>>>False FIll in the blank. Hadoop lacks notion of ________ and _______. Therefore, the analyzed result generated by Hadoop may or may not be 100% accurate. - >>>>Transaction Consistency, Recovery Checkpoint T/F: The main benefit of HadoopDB is that it is more scalable than Hadoop while maintaining the same performance level on structured data analysis workloads. - >>>>False Which of the following is NOT Hadoop drawbacks? A) inefficient join operation B) security issue C) does not optimize query for user D) high cost E) MapReduce is difficult to implement - >>>>D The command for removing a file from hadoop recursively is: hadoop dfs ___________ - >>>>-rmr The __________ holds the data in the HDFS and the application connects with the __________ to send and retrieve data from the cluster. - >>>>Datanode, Namenode T/F: Hadoop is open source. - >>>>True Which of the following is NOT true: A) Hadoop is decentralized B) Hadoop is distributed. C) Hadoop is open source. D) Hadoop is highly scalable. - >>>>

[Show More]

Last updated: 3 years ago

Preview 1 out of 4 pages

Buy Now

Instant download

We Accept: