Senior Software & AI Engineer

Rajakoti Dasari

Python · Java · .NET · iOS/Mobile · Full Stack · Cloud · Generative AI

Senior engineer with 5+ years of professional experience delivering production systems across software engineering, full stack development, mobile applications, and AI. Fluent in Python, Java, and .NET — I build things that scale and ship AI that works in the real world.

Open to remote contract & full-time roles  ·  US Citizen  ·  Available nationwide

AI / LLM Engineer Generative AI Engineer Senior Software Engineer Backend Engineer Full Stack Engineer iOS & Mobile Developer
[email protected] GitHub LinkedIn 📍 Austin, TX — US Citizen
5+
Years experience
8+
Projects shipped
1
US Patent
2
Publications
About

Who I am

RDAdd your photo

I'm a Senior Software & AI Engineer based in Austin, TX with 5+ years of professional experience. I'm fluent across Python, Java, and .NET — I've shipped production code in all three and step into any of these stacks from day one.

My work spans financial services, healthcare, insurance, and enterprise software — backend services handling millions of transactions, microservices at 99.9% uptime, LLM integrations in production, full-stack and mobile applications, and high-volume data pipelines.

I believe the best engineers today sit at the intersection of solid software engineering and AI — I bring both. Clean architecture and strong fundamentals on one side; production-ready generative AI and agentic systems on the other.

Also: Named author on two peer-reviewed publications (IEEE and US technical press) and a contributor on a granted US patent in sensor analytics and health monitoring application development — details available on request.
PG — Machine Learning & Artificial IntelligenceUniversity of Texas at Austin
Master's in EngineeringIndian Institute of Science (IISc), India
Teaching & Mentorship Part-time · Weekends only Mar 2023 — Oct 2024
Teaching Assistant & Mentor
University of Texas at Austin — AI & Machine Learning Certification Program

Mentored working professionals through UT Austin's AI & ML certification — covering Data Science, Machine Learning, and AI fundamentals. Supported learners with project reviews, concept clarification, and Q&A sessions. Delivered entirely on weekends (Saturdays & Sundays, ~4 hrs/day) while working full-time as a Software Engineer.

Skills

Technologies I know well

Built up over 5+ years of shipping real products across different stacks and domains.

🤖
LangChain / LlamaIndex
AI · LLMs
RAG Architecture
AI · LLMs
🧠
Agentic AI — CrewAI, AutoGen
AI · LLMs
OpenAI / Anthropic / Gemini
AI · LLMs
🔬
Prompt Engineering
AI · LLMs
🤗
Hugging Face / Transformers
AI · ML
📊
TensorFlow / PyTorch
AI · ML
🔍
Embeddings / Semantic Search
AI · LLMs · Search
🛠
OpenAI Function Calling / Tool Use
AI · LLMs · Agents
📡
LangSmith / LLM Observability
AI · MLOps · Monitoring
AWS Bedrock
AI · Cloud · LLMs
Java / Spring Boot
Backend · Engineering
🐍
Python / Flask / FastAPI
Backend · Engineering
🔷
.NET / C# / ASP.NET Core
Backend · Engineering
Microservices / Distributed Systems
Backend · Architecture
🔗
REST APIs / GraphQL
Backend · API Design
🔐
OAuth2 / JWT Authentication
Backend · Security · APIs
🧪
Unit Testing — Pytest / JUnit
Backend · Quality · TDD
🐙
Git / GitHub / Version Control
Backend · DevOps · Collaboration
📋
AML / KYC Compliance Systems
Backend · Fintech · Compliance
🏥
HIPAA Compliant Systems
Backend · Healthcare · Security
React.js / Next.js
Full Stack · Frontend
🟩
Node.js / Express.js
Full Stack · Backend
📘
TypeScript / JavaScript
Full Stack · Programming
📱
React Native
Mobile · iOS
🍎
iOS App Development
Mobile · iOS
🔔
Push Notifications / Real-time
Mobile · Cloud
AWS (EC2, S3, Lambda, RDS, Bedrock)
Cloud · Infrastructure · Serverless
🔵
Azure / Azure OpenAI Service
Cloud · DevOps
🐳
Docker / Kubernetes
Cloud · DevOps
🔄
CI/CD — Jenkins, GitHub Actions
DevOps · Automation
🔁
Cloud Data Migration — OCI to AWS S3
Cloud · Infrastructure · Shell Scripting
🌐
Cloud Architecture & Design
AWS · Azure · Multi-cloud · Scalability
Terraform / Infrastructure as Code
Cloud · DevOps · Automation
Redis / Caching Strategies
Cloud · Backend · Performance
Apache Spark / Kafka
Data · Engineering
📈
Power BI / SSRS
Data · Analytics
📊
Tableau
Data · BI · Visualization
🗄
PostgreSQL / SQL Server / DB2
Data · Databases
🔎
Vector DB — Pinecone, Chroma
AI · Data
🔀
ETL Pipelines / SSIS
Data · Engineering
🔁
End-to-End Data Pipelines
Airflow · Spark · Snowflake · dbt
🔧
dbt (Data Build Tool)
Data · Transformation · Analytics Engineering
🍃
MongoDB / Redis / DynamoDB
Data · Databases
SQL Optimization / Query Tuning
Data · Backend · Performance
📨
RabbitMQ / Message Queues
Backend · Messaging · Architecture
Projects

Things I've built

Production work across AI, backend, full stack, mobile, and data engineering. Client names withheld per confidentiality agreements.

AML Monitoring Platform
AI / ML · Fintech

ML risk scoring with open sanctions data for real-time AML compliance screening. Replaced a fully manual review process for a fintech client.

Python MLMarbleOpenSanctionsPostgreSQLDockerAWS
AI RAG Knowledge Assistant
Generative AI

Enterprise RAG with vector indexing and NL Q&A across 10,000+ documents. 35% better retrieval; 50K+ queries/month in production.

LangChainLlamaIndexVector DBOpenAIPythonAWS
Autonomous AI Agent
Agentic AI

Production multi-agent system with planning, tool-calling, and autonomous execution. Reduced operational overhead 40% for a Series A startup.

LangChainCrewAIAutoGPTPythonFastAPIAWS Lambda
Voice-to-Text AI Agent Platform
Agentic AI · Voice · Real-time

End-to-end multi-agent AI platform with 8 specialized agents working in coordination — from voice input to intelligent response via a live agent interface. Built for real-world production use with full AI infrastructure design and optimization.

  • Designed and built 8 coordinated AI agents handling voice-to-text, intent classification, task routing, execution, and response generation
  • Built live agent interface — real-time voice input processed end-to-end through the agent pipeline
  • Designed complete AI infrastructure: model serving, agent orchestration, latency optimization, and scalable deployment
  • Client details confidential — enterprise production deployment
Multi-Agent Systems Voice-to-Text LangChain Real-time AI Python FastAPI AWS AI Infrastructure
Education & Training Platform
iOS · Mobile · Full Stack

Instructor + student dual apps with AI feedback analytics, cloud recording, and real-time communication for 100+ students per session.

  • AI/ML sentiment analysis on student feedback; automated quality scoring
  • Web application built alongside both mobile apps
iOS / MobileReact NativeAI/MLCloud StorageReal-time
Insurance & Fraud Detection
Fintech · Insurance · Java · .NET

Auto, Travel, and Healthcare insurance apps plus real-time AML fraud detection monitoring millions of banking transactions — 24/7 global compliance monitoring.

Java.NETApache SparkSQLSSISSSRS
Academic Publishing Platform
Enterprise · Backend · Java

Large-scale academic publishing with global distribution, multi-level approval workflows, and order processing at millions of records. Resolved critical production incident under high load.

JavaSQLDB2OracleWeb Technologies
BI & Analytics Dashboard
Data / BI

End-to-end data pipelines and Power BI dashboards replacing manual weekly executive reports — multi-source ingestion and Python/SQL transformations.

Power BIPythonSQLApache SparkAWS Redshift
Data Processing Framework
Data Engineering

Reusable enterprise framework — millions of records/second with pluggable ingestion, transformation, and output pipelines. Adopted across multiple internal projects.

PythonApache SparkETLDistributed SystemsSQL
Data API Platform for Analytics
Backend · APIs · Python

Designed and built a scalable FastAPI backend platform delivering analytics datasets to internal applications and dashboards. Integrated with a centralized data warehouse exposing secure REST APIs for high-volume data retrieval.

  • API authentication, query endpoints, and performance optimization for high-volume requests
  • Architecture: Data Warehouse → FastAPI Backend → REST APIs → Applications & Dashboards
PythonFastAPIREST APIPostgreSQLBackend Development
Modern Data Pipeline Architecture
Data Engineering · Snowflake · Power BI

Designed and implemented a scalable end-to-end data pipeline collecting data from external APIs, processing large datasets with Apache Spark, and loading into Snowflake for Power BI analytics dashboards.

  • Apache Airflow orchestrates pipeline workflows; Spark processes large-scale datasets
  • Architecture: API Sources → Airflow → Spark → Snowflake → Power BI
Apache SparkApache AirflowSnowflakePower BIPython
Real-Time Streaming Data Platform
Data Engineering · Kafka · Streaming

Built a real-time data streaming platform processing high-volume event data continuously using Apache Kafka and Spark Structured Streaming, storing results in Databricks Delta Lake for analytics and monitoring.

  • Continuous event processing pipeline handling high-throughput real-time data streams
  • Architecture: Event Producers → Kafka → Spark Streaming → Databricks Delta Lake
Apache KafkaSpark StreamingDatabricksPythonDelta Lake
Open Source

GitHub Repositories

14 public repositories across AI engineering, full stack, backend APIs, Java, .NET, and data engineering. Every project ships real, working code.

📄DocMind AI

Real-Time GenAI Document Q&A Platform using LangChain, RAG, and LLMs for intelligent document understanding.

PythonLangChainRAGGenAI
🤖ai-pdf-chatbot-langchain-rag

AI PDF chatbot using LangChain and RAG architecture for intelligent document querying with OpenAI.

PythonLangChainRAGOpenAI
🧠AI-retrieval-agent-starter

Starter kit for building autonomous AI retrieval agents with tool-calling, planning, and reasoning.

PythonLangChainAgentsFastAPI
SignalOps

Real-Time Streaming Data Platform — Apache Kafka, Spark Structured Streaming, Databricks Delta Lake.

KafkaSparkDatabricksPython
🔀StreamForge

End-to-End Data Engineering Platform — Airflow orchestration, Spark processing, Snowflake warehouse.

AirflowSparkSnowflakePython
🗄WarehouseForge

Enterprise data warehouse architecture with ETL pipelines, Power BI dashboards, and analytics reporting.

SnowflakeETLPower BISQL
🔷ShopSphere

Modern ASP.NET Core Full Stack Reference Application — .NET, C#, REST APIs, full stack architecture.

.NETC#ASP.NET CoreFull Stack
CareTrack

Full Stack Java Application with Spring Boot — healthcare domain, REST APIs, database integration.

JavaSpring BootREST APIMySQL
fastapi-react-admin-boilerplate

Full stack boilerplate — FastAPI Python backend with React admin dashboard, ready to deploy.

FastAPIReactPythonPostgreSQL
🔐fastapi-postgres-auth-backend

Production-ready FastAPI + PostgreSQL authentication backend with JWT tokens and role-based access.

FastAPIPythonPostgreSQLJWT
🛍nextjs-shopify-storefront

Modern Next.js Shopify storefront — full stack e-commerce with TypeScript and Shopify Storefront API.

Next.jsTypeScriptShopifyReact
📝fastapi-blog-api-realworld

RealWorld spec blog API built with FastAPI — production-grade REST API with full CRUD and auth.

FastAPIPythonREST APIPostgreSQL
🏥springboot-clinic-management

Clinic management system built with Spring Boot — Java backend, appointment scheduling, REST APIs.

JavaSpring BootMySQLREST API
📋medium-clone-api-spec

Medium clone API specification — REST API design with full CRUD, auth, and social features.

REST APIAPI DesignBackend
View all repositories → github.com/rajakotid007
Research & IP

Publications & Patent

Granted Patent US Patent
Sensor Analytics & Monitoring Device — Intelligent Data Acquisition System
Named contributor · worked with patent attorney through full USPTO filing & prosecution
Patent details withheld per confidentiality agreement — available upon request
Granted & Published
Published Paper IEEE
Sensor Monitoring Systems & Intelligent Data Acquisition
Rajakoti Dasari, Prabhat Jain, Subhas Sarkar
Full citation available upon request
Peer-reviewed
Published Paper US Technical Magazine
ML-Driven Predictive Monitoring for IoT Sensor Networks
Rajakoti Dasari, Prabhat Jain, Subhas Sarkar
Full citation available upon request
Peer-reviewed
Blog

Writing

Thoughts on software engineering, AI systems, and building things that last.

🔍
📅 March 2025⏱ 6 min read
Why RAG is Still the Right Approach for Enterprise AI in 2025

Everyone is racing to fine-tune their own models. But after building RAG systems serving 50K+ queries a month in production, retrieval-augmented generation remains the most practical path for enterprise AI.

AI EngineeringRAGLLMs
Read More →
🤖
📅 February 2025⏱ 8 min read
Building Production AI Agents: What Nobody Tells You

AI agents look simple in demos. But shipping an autonomous agent to production — where it runs real workflows for real users — is an entirely different challenge. Here's what I learned.

AI EngineeringAgentsLangChain
Read More →
AI Engineering
Why RAG is Still the Right Approach for Enterprise AI in 2025
📅 March 2025⏱ 6 min readBy Rajakoti Dasari

Everyone is racing to fine-tune their own models. But after building RAG systems that serve 50,000+ queries a month in production, I'd argue that Retrieval-Augmented Generation is still the most practical, cost-effective, and maintainable approach for most enterprise AI use cases.

The fine-tuning trap

Fine-tuning feels powerful. You give the model your data, it learns your domain, and suddenly it "knows" your company. But in practice, enterprise knowledge is never static. Product catalogs change weekly. Policies are updated quarterly. Fine-tuning a model on a snapshot of your data means you're constantly out of date — and re-training is expensive in both time and compute.

The question isn't "which approach is technically superior?" It's "which approach can my team actually maintain in production six months from now?"

What RAG actually solves

RAG separates two fundamentally different problems: reasoning (which the base LLM already does well) and knowledge retrieval (which needs to stay fresh). By keeping your documents in a vector database and retrieving relevant chunks at query time, you get:

  • Knowledge that's always up to date — update the index, not the model
  • Source citations — users can see exactly which document an answer came from
  • Controllable costs — you only pay for the tokens in the retrieved context
  • Debuggability — when something goes wrong, you can trace it to a specific retrieval step

The retrieval quality problem

The hardest part of building RAG in production isn't the LLM — it's getting retrieval right. Naive vector similarity search works in demos. In production, you need:

  • Hybrid search — combining dense vector search with sparse keyword matching (BM25)
  • Chunking strategy — how you split documents dramatically affects retrieval quality
  • Re-ranking — a cross-encoder re-ranker on the top-k results improved our accuracy by over 20%

Lessons from production

On the RAG Knowledge Assistant I built for an enterprise client — 10,000+ documents, 50K+ monthly queries — the improvements that mattered most weren't LLM-related at all. They were retrieval improvements: better chunking, metadata filters, and query rewriting. The bottleneck was always: are we giving it the right context?

If you're building enterprise AI today, start with RAG. Get retrieval right. Fine-tune later — only if the retrieval-based approach genuinely can't get you there.

AI Engineering
Building Production AI Agents: What Nobody Tells You
📅 February 2025⏱ 8 min readBy Rajakoti Dasari

AI agents look simple in demos. LangChain, a few tools, an LLM call, and the agent "reasons" its way through a task. But shipping an autonomous agent to production — where it runs real workflows for real users — is an entirely different challenge.

The demo-to-production gap is enormous

In a demo, the happy path works beautifully. In production, you face: ambiguous inputs, tool call failures, infinite reasoning loops, hallucinated tool parameters, and users who give the agent instructions it was never designed to handle.

When a REST API fails, you get an error code. When an agent fails, you get a confidently wrong answer delivered after 45 seconds of "thinking."

The three things that actually matter

1. Constrain the action space ruthlessly

The biggest mistake is giving agents too many tools. Every additional tool is another surface for the model to hallucinate incorrect usage. We went from 12 tools down to 5 — and reliability improved dramatically. Each tool should do exactly one thing and have an unmistakably clear description.

2. Build human-in-the-loop checkpoints

The agent we shipped has explicit checkpoints: before any action that modifies external state (sending emails, updating records, triggering downstream systems), it pauses and surfaces a confirmation step. Users trusted it far more once they understood it wasn't acting silently.

3. Observability is not optional

You cannot debug a reasoning trace by reading logs. We integrated LangSmith from day one — every agent run is traced: which tools were called, in what order, what the LLM was "thinking," where it got stuck. Tools like LangSmith, Langfuse, and Arize are table stakes for production agents.

The reliability framework that worked for us

  • Input validation — classify and sanitize user intent before the agent starts
  • Tool contracts — strict input/output schemas with Pydantic, validated before every tool call
  • Retry with backoff — transient failures are common; build retry logic into every tool
  • Fallback paths — if the agent can't complete confidently, it routes to a human
  • Cost guardrails — set hard limits on token consumption per run

Treat your agent like distributed systems — assume failure, build for observability, and earn trust incrementally.

Contact

Get in touch

Let's connect

Feel free to reach out — whether it's about a role, a collaboration, or just to connect.

Location
Austin, TX — US Citizen
Phone
+1 (323) 451-1479