Category: AG Quiz


  • This is a regular “data quiz”. Follow it on LinkedIn. Test your knowledge or learn something new. Today Question: Which format defines infrastructure using YAML? A) ARM  B) CloudFormation  C) Terraform  D) All Correct Answer: D Explanation ARM, CloudFormation, and Terraform are tools for defining infrastructure as code. ARM (Azure Resource Manager) and CloudFormation (AWS) support JSON and YAML for declarative resource definitions. Terraform uses HCL…

  • This is a regular “data quiz”. Follow it on LinkedIn. Test your knowledge or learn something new. Today Question: Terraform is a tool for: A) Monitoring  B) Orchestration  C) Infrastructure as Code  D) Backup Correct Answer: C Explanation Terraform is an Infrastructure as Code (IaC) tool that defines, provisions, and manages cloud infrastructure using declarative configuration. It supports multiple cloud providers, enabling a unified way to manage…

  • This is a regular “data quiz”. Follow it on LinkedIn. Test your knowledge or learn something new. Today Question: Which service is a “serverless” type? A) EC2  B) Lambda  C) EMR  D) Databricks Correct Answer: B Explanation AWS Lambda is a serverless compute service that runs code without server management. You pay only for execution time, and Lambda automatically scales based on request volume. It is…

  • This is a regular “data quiz”. Follow it on LinkedIn. Test your knowledge or learn something new. Today Question: Azure Data Factory is: A) Storage  B) ETL orchestrator  C) Visualization tool  D) Scheduler Correct Answer: B Explanation Azure Data Factory (ADF) is a cloud ETL and ELT service for orchestrating data flows. It allows extracting data from various sources, transforming it, and loading it into target…

  • This is a regular “data quiz”. Follow it on LinkedIn. Test your knowledge or learn something new. Today Question: In AWS, data flows are often processed using: A) Lambda  B) S3  C) Glue  D) All of the above Correct Answer: D Explanation In AWS, data flows are processed using a combination of several services. S3 serves as a data storage layer, Lambda enables serverless and event-driven processing, and Glue provides ETL…

  • This is a regular “data quiz”. Follow it on LinkedIn. Test your knowledge or learn something new. Today Question: “Which cloud provides BigQuery?: A) AWS B) Azure C) GCP D) Oracle Correct Answer: C Explanation BigQuery is a fully managed serverless data warehouse service provided by Google Cloud Platform (GCP). It enables large-scale data analysis using SQL…

  • This is a regular “data quiz”. Follow it on LinkedIn. Test your knowledge or learn something new. Today Question: “Data Lakehouse” combines features of: A) Lake + Warehouse  B) OLTP + OLAP  C) Kafka + Spark  D) Hadoop + Hive Correct Answer: A Explanation A Data Lakehouse is a modern data architecture that integrates the strengths of a Data Lake and a Data Warehouse. From the Data Lake it inherits the ability to…

  • This is a regular “data quiz”. Follow it on LinkedIn. Test your knowledge or learn something new. Today Question: Delta Lake adds to Parquet files: A) Schema  B) Transactions  C) Metadata  D) All of the above Correct Answer: D Explanation Delta Lake is an open-source storage layer that turns plain Parquet files into a robust transactional data lake with ACID properties. Delta Lake adds four key layers on top…

  • This is a regular “data quiz”. Follow it on LinkedIn. Test your knowledge or learn something new. Today Question: Which Airflow component defines dependencies? A) Task  B) DAG  C) Operator  D) Hook Correct Answer: B Explanation A DAG (Directed Acyclic Graph) is the core building block of Apache Airflow that defines the entire workflow including all tasks and their dependencies. A DAG is an acyclic…

  • This is a regular “data quiz”. Follow it on LinkedIn. Test your knowledge or learn something new. Today Question: Which of these systems is a batch engine? A) Apache Spark  B) Kafka  C) Flink  D) RabbitMQ Correct Answer: A Explanation Apache Spark is primarily a batch processing engine that has become the de facto standard for large-scale data processing in distributed environments. Unlike streaming systems, Spark processes data in…