Data Engineering & Pipelines
Spark / PySpark
Apache Airflow
AWS Glue
dbt
ETL/ELT
Kafka
Cloud & Infrastructure
AWS (EMR, EC2, S3, Athena, EKS)
Kubernetes
Docker
Helm
ArgoCD
GitOps
GitLab CI/CD
Observability & Monitoring
Prometheus
PromQL
Thanos
Grafana
OpsGenie
Databases & Storage
SQL
PostgreSQL
Snowflake
MongoDB
BigQuery
ML & AI
scikit-learn
Clustering
Classification
Regression
TensorFlow / Keras
Spark MLlib
Time-series forecasting
BI & Visualization
Power BI
Tableau
Programming Languages
Python
Golang
SQL
Java