Skip to content
View tvoosa08's full-sized avatar

Highlights

  • Pro

Block or report tvoosa08

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
tvoosa08/README.md

πŸ‘‹ Hi, I'm Teja Voosa

Data Engineer | AWS & GCP Certified | AdTech & Big Data Specialist

Hyderabad, Telangana, India
LinkedIn: linkedin.com/in/tejavoosa
Email: tejavoosa427@gmail.com / tvoosa@gmail.com


πŸš€ About Me

I'm a passionate Data Engineer with hands-on experience designing and building scalable, cost-efficient data pipelines for large-scale AdTech and media applications. At LTIMindtree, I build robust solutions using AWS, GCP, Python, PySpark, Sql and Airflow, driving efficiency and innovation in cloud data workflows. I love tackling big data challenges, modernizing legacy systems, and optimizing cloud costs.


πŸ› οΈ Skills & Tools

aws docker gcp git hadoop hive kafka kubernetes linux mssql mysql postgresql python

  • Cloud: AWS (Redshift, MWAA, Lambda, EMR, S3), GCP (BigQuery, Composer, Cloud Functions, Dataproc)
  • Data Engineering: Airflow, ETL/ELT, Data Migration, Pipeline Optimization
  • Programming: Python, PySpark, SQL
  • Other: Hadoop, Data Quality, Performance Tuning, Cross-functional Collaboration

πŸ† Certifications

  • Google Cloud Certified Professional Data Engineer (Jun 2024 – Jul 2026)
  • Google Certified Associate Cloud Engineer (Nov 2023 – Nov 2026)
  • AWS Certified Developer Associate (Apr 2023 – Apr 2026)
  • AWS Certified Cloud Practitioner (Jan 2023 – Jan 2026)

πŸ’Ό Experience

Data Engineer, LTIMindtree
Jan 2022 – Present | Hyderabad, India

  • Developed scalable data pipelines using Airflow, AWS, and GCP for a leading AdTech client, boosting efficiency by 30%.
  • Built robust ETL workflows for large, diverse datasets across BigQuery, AWS EMR, S3, and Hadoop.
  • Led performance tuning and optimization, reducing processing time and cloud costs.
  • Automated data quality, archiving, and deletion processes for improved accuracy and storage efficiency.
  • Collaborated with data science and engineering teams to deploy solutions at scale.

🌟 Featured Projects

AdTech Data Engineering & Data Science (Mar 2022 – Present)

Tech: Python, PySpark, SQL, Airflow, AWS, GCP

  • Developed and optimized pipelines for CTV/OTT advertising, leveraging identity-matching tech for audience unification.
  • Designed, monitored & optimized Airflow DAGs for high availability and cost efficiency across AWS and GCP.
  • Built custom Airflow operators (e.g., automated table removal) saving ~$10K/day in cloud costs.
  • Led cloud storage migration, unifying regions and slashing expenses.
  • Contributed to next-gen Graph2 (IP & Device Graph) pipeline, replacing legacy systems.
  • Automated dataset deletion across BQ, GCS, and S3 for major savings.
  • Modernized workflows (Python 2β†’3, Linux to Airflow) for better scalability and cost.
  • Partnered with Data Science on audience segmentation, targeting, and analytics enhancements.

πŸ“š Education

MTECH in Software Engineering
Bits-Pilani Hyderabad
Mar 2022 - Mar 2026

B.Sc. in Computer Science
Dr. B.R. Ambedkar University, Srikakulam
Sep 2017 – May 2021


πŸ“« Let's Connect!


Popular repositories Loading

  1. storageflow storageflow Public

    Python 1

  2. DSA DSA Public

  3. tvoosa08 tvoosa08 Public

  4. claude-code claude-code Public

    Forked from yasasbanukaofficial/claude-code

    πŸš€ Open source Claude Code CLI source code. Advanced AI Agent for developers. Includes TypeScript codebase for LLM tool-calling, agentic workflows, and terminal UI. Remember this is just the skeleto…

    TypeScript