-
TATA technologies ltd.
- Pune
- http://www.linkedin.com/in/SwapnilTaware
Pinned Loading
-
Milvus-vector-database-project
Milvus-vector-database-project PublicThis project asynchronously scrapes web content, generates semantic text chunks using sentence embeddings, and stores them in a Milvus vector database for efficient similarity search. Built with Py…
Python
-
databricks-fmcg-medallion-architecture-delta-lake
databricks-fmcg-medallion-architecture-delta-lake PublicEnd-to-end FMCG data engineering project on Databricks using Medallion Architecture and Delta Lake. Ingests CSV data from Volumes/S3, processes Bronze–Silver–Gold layers with PySpark, supports incr…
Jupyter Notebook
-
Data-Interpolation-with-Radial-Basis-Function
Data-Interpolation-with-Radial-Basis-Function PublicA PySpark-based solution for cleaning and interpolating battery sensor data using forward/backward fill and Radial Basis Function (RBF) spatial interpolation. Outputs a clean, fully interpolated da…
Python
-
Python-to-ELK-data-pipeline-project
Python-to-ELK-data-pipeline-project PublicA Python-based ETL pipeline that extracts data from an Oracle database using SQL, transforms it into a structured format, and indexes it into Elasticsearch for analytics and reporting.
Python
-
Pyspark_data_pipeline_with_Airflow_orchastration
Pyspark_data_pipeline_with_Airflow_orchastration PublicThis repository contains an Airflow DAG that orchestrates an incremental data pipeline using PySpark scripts. The pipeline automates daily processing data, syncs results to S3, performs housekeepin…
Python
-
Data-Analyst-Challenge
Data-Analyst-Challenge PublicA comprehensive data analytics project showcasing data ingestion, cleaning, exploratory data analysis (EDA), statistical evaluation, and insightful visualizations using Jupyter Notebook. Designed t…
Jupyter Notebook
If the problem persists, check the GitHub status page or contact support.