Skip to content
View itsSwapnil's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report itsSwapnil

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. Milvus-vector-database-project Milvus-vector-database-project Public

    This project asynchronously scrapes web content, generates semantic text chunks using sentence embeddings, and stores them in a Milvus vector database for efficient similarity search. Built with Py…

    Python

  2. databricks-fmcg-medallion-architecture-delta-lake databricks-fmcg-medallion-architecture-delta-lake Public

    End-to-end FMCG data engineering project on Databricks using Medallion Architecture and Delta Lake. Ingests CSV data from Volumes/S3, processes Bronze–Silver–Gold layers with PySpark, supports incr…

    Jupyter Notebook

  3. Data-Interpolation-with-Radial-Basis-Function Data-Interpolation-with-Radial-Basis-Function Public

    A PySpark-based solution for cleaning and interpolating battery sensor data using forward/backward fill and Radial Basis Function (RBF) spatial interpolation. Outputs a clean, fully interpolated da…

    Python

  4. Python-to-ELK-data-pipeline-project Python-to-ELK-data-pipeline-project Public

    A Python-based ETL pipeline that extracts data from an Oracle database using SQL, transforms it into a structured format, and indexes it into Elasticsearch for analytics and reporting.

    Python

  5. Pyspark_data_pipeline_with_Airflow_orchastration Pyspark_data_pipeline_with_Airflow_orchastration Public

    This repository contains an Airflow DAG that orchestrates an incremental data pipeline using PySpark scripts. The pipeline automates daily processing data, syncs results to S3, performs housekeepin…

    Python

  6. Data-Analyst-Challenge Data-Analyst-Challenge Public

    A comprehensive data analytics project showcasing data ingestion, cleaning, exploratory data analysis (EDA), statistical evaluation, and insightful visualizations using Jupyter Notebook. Designed t…

    Jupyter Notebook