// hi, I'm

Varshitha Gogineni

AI Engineer / Data Engineer

I build agentic AI systems, real-time voice agents, and LLM-powered tool-calling pipelines — with a strong data-engineering foundation backing every model. Recognized at Dallas AI 2025 (Top 14 of 32). Recently graduated from the University of North Texas with an MS in Information Science (4.00 / 4.00 GPA, 2026).

01 / about

A bit about me

I'm an AI Engineer focused on shipping agentic AI, real-time voice agents, and LLM-powered backends. I've built tool-calling pipelines that handle 100+ live scenarios with strict JSON-schema validation, voice workflows that autonomously close out 70–80% of inbound calls, and RAG assistants that hold context across multi-turn sessions.

My core stack is Python, LangChain, Gemini / OpenAI APIs, Whisper STT, and MCP — wired up with Twilio and n8n for production workflows.

Backed by a strong data-engineering foundation — BigQuery, PySpark, Airflow, dbt, Pub/Sub on GCP — so the models I ship sit on top of pipelines that are observable and reliable.

Varshitha Gogineni at her University of North Texas graduation
MS, Information Science

University of North Texas · Class of 2026

GPA 4.00 / 4.00

Dallas AI 2025

Top 14 of 32 teams

VoiceFit

Denton, TX

Open to AI / Data Engineering roles

Download my résumé

PDF · AI Engineer · updated 2026

02 / experience

Where I've worked

AI Engineer Intern @ Cambo Box

United States, Remote

Aug 2025 — Dec 2025

  • Designed a production-ready agentic AI voice workflow for restaurants, autonomously handling 70–80% of inbound calls and automating outbound reservation reminders.
  • Integrated AI voice agents with Twilio telephony supporting real-time barge-in and seamless escalation, improving call-handling efficiency by ~40%.
  • Architected tool-calling pipelines for 100+ reservation scenarios with strict JSON schema validation, reducing invalid AI tool calls by ~90%.
  • Built REST API orchestration and cron-driven outbound workflows for sub-second response and zero-touch daily reservation confirmations.
PythonTwilioAgentic AITool CallingREST APIs

IT Services Team Lead & Web Developer @ University of North Texas — CMHT IT Services

Denton, TX

Dec 2024 — May 2026

  • Promoted from classroom technology assistant to Team Lead, managing student assistants across 180+ managed laptops for students and CMHT faculty.
  • Develop API-driven web applications and internal data tooling for 5+ academic departments, cutting response time on key endpoints by ~30%.
  • Built ETL utilities and n8n automations to auto-generate reports and route form submissions — eliminating ~10 hours of manual work per week.
  • Maintained team SOPs and IT documentation; provided hardware/software troubleshooting across CMHT.
JavaScriptn8nETLREST APIsSOPs

03 / projects

Selected work

AI Engineering

3 projects

Agentic AI · Playwright MCP

Self-Healing Browser Automation Agent

Carrier-agnostic agentic loop that completes an 8-page workers' compensation insurance quote end-to-end with zero carrier-specific code — an OpenAI reasoning model drives a Playwright MCP server controlling a real Chromium browser via function calling.

  • Engineered a self-healing core that auto-captures the live page snapshot on any tool failure and reinjects it, letting the model recover from complex Angular/PrimeNG widgets in real time without hard-coded selectors.
  • Delivered a validated live submission ($539 instant quote, 0 errors, ~9-minute run); added recipe record-and-replay so routine runs cost cents — cutting projected cost at scale by ~10x.
  • Designed a modular TypeScript architecture (agent loop, MCP client, per-carrier mapping, CLI) with token/cost tracking — adding a new carrier requires only a single new file.
Playwright MCPOpenAITypeScriptNode.jsAgentic AI
$539 live quote · ~10x cheaper

LLM Voice Assistant · RAG

CMHT Voice Agent

Real-time LLM voice assistant for UNT's College of Merchandising, Hospitality and Tourism — helps students and faculty locate rooms and provides front-desk help.

  • Built with Gemini + LangChain, modular tool-calling, and session memory for context-aware multi-turn workflows.
  • Optimized API latency and token usage via prompt caching, batched requests, and efficient context window management.
Gemini APIsLangChainPythonRAG
speech-to-speech

Dallas AI 2025 · Top 14

VoiceFit — AI Voice Assistant

Hands-free AI fitness companion using Whisper STT, React, n8n, and OpenAI. Recognized as Top 14 of 32 teams at Dallas AI 2025.

  • Built a voice-enabled AI assistant with Whisper STT and a responsive React front-end.
  • Designed automated task workflows in n8n reducing manual interaction steps by 60%+.
Whisper STTReactn8nOpenAISupabase
live demoOpen →

Data Engineering

2 projects

Cloud-Native Pipeline · GCP

Healthcare Data Lifecycle

End-to-end GCP data pipeline processing 1,048,575 CDC BRFSS healthcare survey records (2020–2024) into real-time informatics insights.

  • Engineered scalable ETL, batch, and streaming workflows with Cloud Storage, BigQuery, and Spark + Hive on Dataproc.
  • Reduced health-alert delivery latency from minutes to under 5s via Pub/Sub streaming into BigQuery.
  • Delivered Power BI dashboards across 27 BRFSS variables for near-real-time public health monitoring.
BigQueryPySparkHiveDataprocPub/SubPower BI
1.04M records

Batch Data Engineering

NYC Taxi Trips Pipeline

End-to-end batch pipeline ingesting 10M+ NYC Yellow Taxi trips into BigQuery with bronze/silver/gold medallion layering and dbt-modeled analytics.

  • Orchestrated ingestion, transforms, and Great Expectations data quality checks as Apache Airflow DAGs with retries.
  • Published metrics in a Looker Studio dashboard for at-a-glance KPI monitoring.
BigQueryApache AirflowdbtGreat ExpectationsPython
10M+ trips

04 / skills

Tools I build with

AI Engineering

Agentic AI & LLMs

RAGAgentic AITool CallingLangChainGemini APIsOpenAI APIsModel Context ProtocolPrompt CachingSession Memory

Voice & Real-Time AI

Whisper STTPipecatTwilioReal-time barge-inTTS / STTStreaming responses

ML, NLP & Automation

NLPCNNsInformation Retrievaln8nAzure AI

Languages

PythonTypeScriptJavaScriptSQLPySparkHiveQL
Data Engineering Foundation

Data Engineering & Big Data

BigQueryApache SparkHiveDataprocApache AirflowdbtGreat ExpectationsPub/SubCloud StorageETL/ELTPower BILooker Studio

Cloud, Databases & Tooling

GCPDockerGit / GitHubPostgreSQLMongoDBMySQLFirebasePinecone Vector DB

05 / wins

Achievements & certifications

Highlight

Dallas AI 2025 — Top 14 of 32 teams

Recognized for VoiceFit, an AI-powered hands-free fitness companion (OpenAI Whisper, n8n, Supabase). Live at voicefit.vercel.app.

Highlight

Y Combinator · Gemini × Pipecat Hackathon

Participated at YC during SF Tech Week 2025 — co-built an AI-powered Mock Interview Platform with Gemini Live and Pipecat, enabling real-time voice, screen-share, and AI code review.

Highlight

CMHT Voice Assistant — UNT

Built a speech-to-speech AI agent for UNT's College of Merchandising, Hospitality and Tourism that helps students and faculty locate rooms and provides front-desk assistance.

Certifications · Apr 2026

  • Anthropic — Introduction to Agent Skills
  • Anthropic — Introduction to Model Context Protocol (MCP)
  • Anthropic — Building with the Claude API
  • Anthropic — Claude Code in Action

06 / contact

Let's talk

Looking to collaborate on agentic AI, voice agents, or cloud data pipelines? Drop your name and email and I'll get back to you.

Based in Denton, TX · open to remote / hybrid.