Big Data Engineer
Description:
Expert in developing complex data pipelines, optimizing, and managing large volumes of batch and streaming data in distributed environments.
Key skills:
-
Experience with orchestration tools like Apache Airflow, Google Cloud Composer, Google Dataform
-
Streaming with Apache Kafka, Pub/Sub
-
ETL/ELT on BigQuery, Cloud Dataflow, Dataproc
-
Preferred: Knowledge of Google Datastream, Data Fusion
-
Relational databases: MariaDB, MySQL, SQL Server
-
Languages: Python, SQL (Java is a plus for complex streaming flows)
-
Data modeling and datamart creation