Skip to content
View ashuhimself's full-sized avatar
🌏
Available
🌏
Available

Highlights

  • Pro

Block or report ashuhimself

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ashuhimself/README.md

header

Typing SVG

πŸš€ Transforming Raw Data into Actionable Insights

Passionate about building robust data infrastructure that scales

LinkedIn Email GitHub

Profile Views


πŸ§‘β€πŸ’» About Me

class DataEngineer:
    def __init__(self):
        self.name = "Ashutosh Tiwari"
        self.role = "Senior Data Engineer"
        self.experience = "4+ years"
        self.location = "India"

    def get_skills(self):
        return {
            "languages": ["Python", "SQL", "Bash"],
            "big_data": ["Apache Spark", "Apache Kafka", "Airflow"],
            "databases": ["PostgreSQL", "BigQuery", "Redshift", "Elasticsearch"],
            "cloud": ["AWS", "Azure", "GCP"],
            "devops": ["Docker", "Kubernetes", "Terraform", "Jenkins"],
            "specialties": ["ETL/ELT", "Data Pipelines", "Real-time Analytics"]
        }

    def current_focus(self):
        return ["Real-time Data Streaming", "MLOps", "Data Lake Architecture"]

πŸ› οΈ Tech Stack & Skills

Programming & Core Technologies

Big Data & Analytics

apache spark kafka airflow elasticsearch databricks

Cloud Platforms

DevOps & Infrastructure

Data Warehouses & Analytics

bigquery redshift clickhouse dbt

Currently Learning

huggingface tensorflow pytorch

πŸ’Ό Professional Highlights

πŸ—οΈ Architecture & Design

  • Designed scalable data pipelines processing 10M+ records daily
  • Built fault-tolerant systems with 99.9% uptime SLA
  • Implemented multi-cloud data lake architectures

⚑ Performance & Optimization

  • Optimized ETL processes reducing execution time by 70%
  • Achieved 40% cost reduction in cloud infrastructure
  • Streamlined data workflows with automated monitoring

πŸ”„ Real-time & Streaming

  • Built real-time data solutions with Kafka & Spark Streaming
  • Implemented event-driven architectures
  • Created low-latency analytics dashboards

☁️ Multi-Cloud Expertise

  • AWS: S3, Glue, Redshift, Lambda, EMR, Kinesis
  • Azure: Data Factory, Synapse, Databricks, Event Hubs
  • GCP: BigQuery, Dataflow, Pub/Sub, Dataproc

πŸ“Š GitHub Analytics

GitHub Stats Top Languages
GitHub Streak
Contribution Graph

🌟 Current Focus

πŸ”₯ Building: Next-generation real-time analytics platforms with AI integration πŸ“š Learning: AI/ML Transformers, RAG (Retrieval-Augmented Generation), MCP protocols πŸš€ Exploring: Advanced Airflow patterns and AI-powered data pipelines 🀝 Open to: Exciting data engineering and AI opportunities

🀝 Connect with me!


Employer?

Important

πŸš€ 4+ Years Data Engineering Experience | Multi-Cloud Expert | Open to Opportunities

Ready to build your next-generation data infrastructure! Let's connect and discuss how I can drive your data initiatives forward.

πŸ’‘ "Turning data into insights, one pipeline at a time"

⭐ **Star this repo if you find it interesting!** ⭐

footer

Popular repositories Loading

  1. mlops mlops Public

    The Complete End-to-End Machine Learning Operations Ecosystem

    Python 4 2

  2. ashuhimself ashuhimself Public

    1

  3. data-platform-iac data-platform-iac Public

    A fully open-source, Infrastructure-as-Code implementation of a modern data platform. This repository provisions and deploys a complete data stack on AWS EC2 virtual machines with no managed servic…

    HCL 1 1

  4. airspark airspark Public

    This project demonstrates how to set up Apache Airflow with Apache Spark using Docker. It provides a seamless way to manage and execute Spark jobs within Airflow DAGs. By leveraging Docker and Astr…

    Python

  5. Airflow-dag-repo-scanner Airflow-dag-repo-scanner Public

    Python

  6. ai-schema ai-schema Public

    Python