Your Name

Education

Ramniranjan Jhunjhunwala College of Science and Commerce [University of Mumbai] - May 2022

Bachelor of Science (B.Sc.) in Statistics with C.G.P.A 9.6.

Projects

Data Engineering Pipeline with Open-Source Tools

  • Technologies: Airbyte, MinIO S3, dbt, OpenMetadata, Metabase, Docker
  • Built a scalable, automated data pipeline using Airbyte for data extraction, MinIO S3 for storage, and PostgreSQL for warehousing.
  • Transformed data into analytical models with dbt and implemented OpenMetadata for governance and data lineage.
  • Delivered actionable insights through Metabase visualizations; containerized the entire solution with Docker for scalability and portability.

Real-Time Election Voting System

  • Technologies: PostgreSQL, Confluent Kafka, Apache Spark, Streamlit, Docker
  • Developed a real-time voting simulation system integrating voter demographics via an API into a PostgreSQL database.
  • Streamed votes using Kafka and aggregated them efficiently with Spark.
  • Visualized live results through an interactive Streamlit dashboard.
  • Containerized the system with Docker for scalability and seamless deployment.

Skills

  • Programming Languages: Python, SQL (Intermediate)
  • Source Control: Git, GitHub (Beginner)
  • Big Data Technologies: Spark, Kafka (Beginner)
  • Cloud Platforms: Azure (Intermediate), AWS (S3, Redshift) (Beginner)
  • Data Visualization: Power BI, Tableau (Intermediate)
  • Containerization: Docker, Kubernetes (Beginner)
  • Data Integration and Orchestration: Airbyte, Airflow (Beginner)
  • Core Competencies: CI/CD concepts (Beginner), Data Analysis, Data Modeling, ETL Processes (Intermediate)
  • CI/CD Tools: Jenkins (Beginner)