BigData & Hadoop Welcome to your BigData and Hadoop All of the following accurately describe Hadoop, EXCEPT- Open source Real-time Java-based Distributed computing approach Hint Which tool could be used to move data from RDBMS data to HDFS? Flume Sqoop Both the above Hive Hint Which of these provides a Stream processing system used in Hadoop ecosystem? Hive Solr Tez Spark Hint During Safemode Hadoop cluster is in- Read-only Write-only Read-Write None of the above Hint Which among the following are the features of Hadoop open source fault tolerant high availability ALL of the above Hint The need for data replication can arise in various scenarios like- Replication Factor is changed DataNode goes down Data Blocks get corrupted ALL of the above For the frequently accessed HDFS files the blocks are cached in- the memory of the datanode in the memory of the namenode Both the above None of the above Clients connect to ________ for I/O NameNode DataNode Secondary NameNode None of the above The HDFS command to create the copy of a file from a local system is which of the following? copyFromLocal CopyFromLocal CopyLocal copyfromlocal HDFS provides a command line interface called __________ used to interact with HDFS HDFS Shell FS Shell DFS Shell None of the mentioned Which of the follwing is the faeture of PIG? Rich Set of Operators Extensibility Optimization opportunities All of the above In which all languages you can code in Hadoop ? Python Java C++ All of the above Which of the following phases occur simultaneously ? Shuffle and Sort Reduce and Sort Shuffle and Map All of the mentioned Which of the following is used to ingest streaming data into Hadoop clusters? Flume Sqoop Both of the above None of the above Which of the following is true about MapReduce? Data processing layer of hadoop It provides the resource management It is an open source data warehouse system for querying and analyzing large datasets stored in hadoop files All of the above Which of the following is a data processing engine for clustered computing? Drill Spark Oozie ALL of the above Which scenario demands highest bandwidth for data transfer between nodes Different nodes on the same rack Nodes on different racks in the same data center Nodes in different data centers Data on the same node. Time's up neXt Era Technologies