Q1. What is the term for the process of analyzing large and complex datasets to uncover patterns, trends, and insights?
Data Analysis
Data Warehousing
Data Visualization
Data Mining
Q2. Big Data can be found in how many versions?
1
4
2
3
Q3. How many V's are there in Big Data?
3
2
5
4
Q4. Which of these options is Hadoop named after?
Creator Doug Cutting's favourite circus act
The toy elephant of Creator Cutting's son
Cutting's high school best friend
A sound Cutting's laptop made during Hadoop development
Q5. Which of the following is not a data type commonly encountered in Big Data?
XML
Binary
JSON
CSV
Q6. What can be described as a model for programming used to develop applications based on Hadoop that can process massive amounts of data?
Mahout
None of the above
Oozie
MapReduce
Q7. Which type of data refers to data that is generated in real-time or near real-time?
Structured Data
Semi-Structured Data
Streaming Data
Unstructured Data
Q8. Which technology is commonly used for distributed data storage in Big Data systems?
MongoDB
Cassandra
SQL
HDFS
Q9. Which of these has the world's largest Hadoop cluster?
Facebook
All of the above
Datamatics
Apple
Q10. Which of the following is not a characteristic of Big Data?
Variety
Velocity
Volume
Velocity
Q11. All the options given accurately describe Hadoop except one. Which one is it?
Open-source
Real-time
Distributed computing approach
Java-based
Q12. Which technology is commonly used for real-time stream processing in Big Data systems?
Spark
Hadoop
Kafka
Flink
Q13. Which technology is commonly used for real-time data analytics and visualization?
Power BI
Databricks
QlikView
Tableau
Q14. Hadoop is a framework. It is used with several types of related tools. What are its common cohorts?
MapReduce, Hummer, and Iguana
MapReduce, MySQL, and Google Apps
MapReduce, Hive, and HBase
MapReduce, Heron, an Trumpet
Q15. Which of the following is not a challenge associated with Big Data?
Scalability
Privacy
Data Consistency
Security
Q16. Which technology framework is commonly used for distributed storage and processing of Big Data?
Kafka
Spark
Hadoop
Flink
Q17. Which of these projects based on Hadoop is used by Facebook to tackle with Big Data?
Prism
Project Prism
Project Data
Project Big
Q18. What is the term for the process of integrating data from multiple sources to create a unified view?
Data Aggregation
Data Normalization
Data Integration
Data Fusion
Q19. Which of the following is not a key feature of Apache Spark?
MapReduce Support
Real-time Processing
In-memory Computing
Batch Processing
Q20. What is the term for a collection of data that is too large to be processed using traditional database techniques?
Data Lake
Data Stream
Data Pond
Data Reservoir
Q21. Which of the following is not a characteristic of a data warehouse?
Optimized for analytics
Historical data
Integrated data
Real-time processing
Q22. Which technology is commonly used for distributed messaging in Big Data systems?
Kafka
Hadoop
Flink
Spark
Q23. Which type of database is optimized for handling transactional workloads and providing high availability?
NoSQL
OLTP
OLAP
NewSQL
Q24. Data is what size of bytes is known as Big Data?
Peta
Meta
Tera
Giga
Q25. What is the term for a large volume of data that cannot be processed using traditional database techniques?
Mega Data
Massive Data
Huge Data
Big Data
Q26. Which of the following is not a layer of the Big Data stack?
Application Layer
Presentation Layer
Storage Layer
Processing Layer
Q27. What is the term for the process of storing data across multiple servers to ensure redundancy and fault tolerance?
Data Replication
Data Partitioning
Data Sharding
Data Redundancy
Q28. What is the transaction data of the bank?
Unstructured data
None of the above
Structured data
Both 1 and 2
Q29. Which of the following is not a component of the Hadoop ecosystem?
HDFS
YARN
MapReduce
Spark
Q30. What is the term for the process of cleaning and transforming raw data into a usable format for analysis?
Data Scrubbing
Data Staging
Data Cleansing
Data Preparation