DP_PORTFOLIO DATA_ENGINEER NYC · USA v1.0 · 2025
Data Engineer

Dhruvil Panchal.

ETL/ELT Specialist Analytics Engineer Cloud Data Architect
5+
Years Exp.
1M+
Records Processed
40%
Deploy Efficiency ↑

Data Engineer with expertise in building scalable data pipelines, optimizing ETL/ELT processes, and implementing cloud-based data solutions.

Passionate about transforming raw data into valuable insights through robust engineering. Experienced with the modern data stack — from ingestion through Airbyte and Fivetran, transformation with dbt, orchestration in Airflow, to analytics-ready warehouses in Snowflake and BigQuery.

Holds a Masters in Information Systems and brings hands-on experience from financial services, enterprise RBAC, and cloud-native pipeline design.

Scalable data pipeline design
ETL/ELT performance optimization
Cloud data warehouses & lakes
Real-time data processing
Data quality & monitoring
Workflow automation & CI/CD
Location New York, NY · Open to relocate
Phone +1-407-494-1791
Email dpanchal.dp.2005@gmail.com
Degree M.S. Information Systems
Job Type Full-time · Contract
Freelance Available

A full end-to-end data stack covering every layer of the modern data engineering lifecycle — from raw source ingestion to analytics-ready consumption.

Ingestion
Apache Kafka AWS Glue Airbyte Fivetran REST APIs S3 / HDFS
Storage
Snowflake AWS S3 PostgreSQL Delta Lake AWS Lake Formation Athena
Transform
dbt Core Apache Spark PySpark Pandas Talend SSIS
Orchestrate
Apache Airflow GitHub Actions Docker AWS Lambda CI/CD
Visualize
Power BI Tableau Flask APIs Scikit-learn
Cloud
AWS Azure GCP Snowflake Cloud
Languages
Python SQL YAML Bash PySpark
01
Modern ELT Pipeline with dbt
Production-grade ELT pipeline using dbt Core, Snowflake, and Airflow with automated data quality tests and CI/CD deployment via GitHub Actions.
dbt Snowflake Airflow
View Project
02
End-to-End ML Pipeline
Regression and classification models with feature engineering pipelines and production Flask API deployment. Full model lifecycle management.
Scikit-learn Flask Pandas
View Project
03
ETL Pipeline for Financial Data
Scalable ETL pipeline processing 100K+ financial transactions daily using Python and AWS Glue with robust error handling and monitoring.
Python AWS Glue
View Project
04
Real-time Analytics Dashboard
Kafka streaming pipeline feeding live Power BI dashboards for US consumer complaint analysis with drill-through and trend detection.
Kafka Power BI
View Project
05
Cloud Data Warehouse
Snowflake-based cloud data warehouse for Adidas US sales data with dbt transformations, star-schema modeling, and Tableau reporting layer.
Snowflake dbt
View Project
06
Geospatial Job Market Analysis
Geospatial data processing system for NYC job market analytics using Python and GeoPandas with automated Airflow DAGs and PostgreSQL.
GeoPandas Airflow PostgreSQL
View Project

Data Engineering

ETL/ELT Pipelines
90%
Data Warehousing
85%
Data Modeling
80%

Cloud

AWS
85%
Azure
75%
GCP
70%

Programming

Python
90%
SQL
85%
PySpark
75%

Tools & Frameworks

Airflow
80%
Snowflake
75%
dbt
70%
Snowflake
Hands-On Essentials: Data Warehousing Workshop
ELT pipelines, clustering, and query performance optimization in Snowflake.
Snowflake
Hands-On Essentials: Collaboration, Marketplace & Cost Estimation Workshop
Data sharing, Snowflake Marketplace, and cost governance strategies.
dbt Labs
dbt Fundamentals
Core dbt concepts: models, tests, documentation, sources, and deployment workflows.
HackerRank
SQL (Intermediate)
Complex joins, subqueries, window functions, and query optimization techniques.
2023 — Present
ThirthaSoft, LLC
Data Engineer
  • Designed a scalable data ingestion factory processing 8,000+ daily historical records using Apache Airflow DAGs with fault tolerance
  • Built modular ELT pipelines (Python, YAML, Spark) for CSV/JSON/XML/REST APIs, reducing onboarding time by 60%
  • Deployed pipelines to Snowflake via CI/CD (GitHub Actions, Docker, AWS), improving deployment efficiency by 40%
  • Architected cloud-native data layers with partitioning, schema enforcement, and RBAC compliance
2022
ITHENA
Data Engineer Intern
  • Engineered secure ETL workflows (Python, AWS S3/Lambda/Athena) processing 100K+ daily IAM logs
  • Consolidated 500K+ RBAC records into a centralized AWS Lake Formation data lake
  • Identified 1,200+ access control issues ensuring 100% audit compliance
  • Architected an RBAC data mart for 15+ business units, reducing role conflicts by 30%
2019 — 2021
BrainyBeam Technologies
Data Engineer
  • Designed SQL queries, triggers, and stored procedures for ETL workflows managing 1M+ records
  • Optimized query performance by 30% through indexing strategies and execution plan analysis
  • Implemented data pipelines with Python, Talend, and SSIS for enterprise systems
  • Maintained star/snowflake schemas for Tableau and Power BI reporting layers

Let's build something great.

Email
dpanchal.dp.2005@gmail.com
Phone
+1 407 494 1791
Location
New York, NY