Dice is the leading career destination for tech experts at every stage of their careers. Our client, Apetan Consulting, is seeking the following. Apply via Dice today!<br><br><strong>Key Responsibilities<br><br></strong><ul><li>Design, build, and maintain ETL/ELT data pipelines</li><li>Develop Python-based data processing applications</li><li>Work with structured and unstructured data at scale</li><li>Integrate data from multiple sources (APIs, databases, files, streams)</li><li>Optimize data workflows for performance and reliability</li><li>Ensure data quality, validation, and monitoring</li><li>Collaborate with data scientists, analysts, and backend teams</li><li>Manage and maintain data warehouses/lakes</li><li>Implement logging, error handling, and automation</li><li>Follow best practices for security and compliance<br><br></li></ul><strong>Required Skills<br><br></strong>Programming<br><br><ul><li>Strong Python (Pandas, NumPy, PySpark)</li><li>Writing clean, modular, and testable code<br><br></li></ul>Databases & Storage<br><br><ul><li>SQL (PostgreSQL, MySQL, SQL Server)</li><li>NoSQL (MongoDB, Cassandra optional)</li><li>Data Warehouses (Snowflake, Redshift, BigQuery)<br><br></li></ul>Big Data & Processing<br><br><ul><li>Apache Spark, Hadoop (preferred)</li><li>Batch and streaming data processing<br><br></li></ul>Cloud Platforms <br><br><ul><li>AWS / Azure / Google Cloud Platform</li><ul><li>S3, Lambda, Glue, Dataflow, BigQuery, etc.<br></li></ul></ul>Data Engineering Tools<br><br><ul><li>Airflow, Prefect, Luigi (orchestration)</li><li>Kafka / PubSub (streaming optional)</li><li>DBT (data transformation)<br><br></li></ul>DevOps & Other<br><br><ul><li>Git, CI/CD</li><li>Docker, Kubernetes (nice to have)</li><li>Linux basics</li></ul>