Skip to content
View Ps-budd's full-sized avatar

Block or report Ps-budd

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Ps-budd/README.md

Aditya Dubey — Data Engineer • Cloud Architect • AI Builder

Aditya Dubey

Data Engineer · Cloud Architect · AI Builder · Creator of mygol.ai

Google Cloud Certified — Professional Cloud Architect     Certified ScrumMaster (CSM) — Scrum Alliance     Tableau Desktop Specialist

Email LinkedIn GitHub Followers Resume profile views

Aditya Dubey Innovative Data Engineer with 5+ years building cloud‑native data platforms and privacy‑first APIs on GCP/AWS/Snowflake. Led cost and latency wins including $100K/month BigQuery savings and 87% delivery‑SLA reduction. I ship observable, scalable systems and pragmatic LLM/RAG features with guardrails.


🚀 What I’m focused on

  • Cloud‑native data platforms on GCP (Cloud Run, Dataflow, Pub/Sub, BigQuery, dbt, Airflow, Spark)
  • Privacy‑first APIs (GDPR/CCPA), observability with OpenTelemetry, and async event pipelines
  • AI features: LLM/RAG, evals and hallucination mitigation for production reliability
  • Now building: mygol.ai — an automated job‑apply engine on Cloud Run + Next.js

🏅 Certifications

Certified ScrumMaster    Google Cloud Professional Cloud Architect    Coursera — Stanford Machine Learning (Andrew Ng)


🌟 Highlighted repository


📚 Popular repositories


🧩 Featured builds

  • First‑Party Segment API (GCP) — Cloud Run + API Gateway + Pub/Sub/Dataflow; OpenTelemetry tracing; opt‑out & expiration endpoints.
    Stack: GCP · Cloud Run · API Gateway · BigQuery · OpenTelemetry

  • CRM Delivery Platform — PII encryption/salting; SFTP/GCS/S3 chunked delivery with retries; 2 days → 6 hours.

  • LLM Audience Segmenter — NL → audience segments; 90% manual effort ↓; 85% launch time ↓; guardrails + evals.
    Stack: FastAPI · OpenAI/Gemini · Guardrails · Evals

  • BigQuery Cost Optimizer — stored procedures + usage analyzer; partition/prune; ~$100K/month saved.


🔧 Toolbox

BigQuery Snowflake Redshift GCS S3 Cloud Run Dataflow Pub/Sub Airflow dbt Spark Kafka Python Go TypeScript FastAPI React Next.js Docker Terraform GitHub Actions OpenTelemetry


📈 GitHub at a glance

GitHub stats Top languages


📬 Get in touch

Always happy to chat about data platforms, cloud, and productionizing AI.

Popular repositories Loading

  1. Traffic-Bolt Traffic-Bolt Public

    A analytical pipeline that provide data driven insights on city traffic.

    Python 3

  2. Building-Big-Data-Pipelines-with-PySpark-flask-MongoDB-Bokeh Building-Big-Data-Pipelines-with-PySpark-flask-MongoDB-Bokeh Public

    Data Preprocessing using Python+Flask• Data Visualization- geo-map plot, bar chart, magnitude plot using Bokeh lib. • Machine learning using Pyspark & Mllib to build Predictive models. • Creating t…

    Python 1 9

  3. -Crop-Yield-Prediction- -Crop-Yield-Prediction- Public

    • Prediction of crop yield for upcoming 5 years based on historical data using Python and data mining Techniques. • Build a recommender system for seasonal crops using collaborative filtering. • Pr…

    Jupyter Notebook 1 1

  4. Attendance_app Attendance_app Public

    Student attendance management system is concerned with managing the attendance data of the student. This enhances student attendance based on class participation. It is preserved on their presence …

    Java

  5. COVID-19-data-analysis-and-Dashboards COVID-19-data-analysis-and-Dashboards Public

    • Covid-19 Analysis using Power Bi: • Created Live Dashboards and visualizations of processed data to identify trends for Covid-19 analysis. • Build Geo map of Covid-19 confirmed, recovered & death…

  6. -ROSSMAN-SALES-FORECAST- -ROSSMAN-SALES-FORECAST- Public

    KNN classifier, Naive Bayes classifier, Random forest, Hypothesis testing, Anova, R studio, Python. • Predicted future sales using random forest. • Provide the Business decision on promos using Hyp…

    R