Rajakoti Dasari

About

Who I am

RDAdd your photo

I'm a Senior Software & AI Engineer based in Austin, TX with 5+ years of professional experience. I'm fluent across Python, Java, and .NET — I've shipped production code in all three and step into any of these stacks from day one.

My work spans financial services, healthcare, insurance, and enterprise software — backend services handling millions of transactions, microservices at 99.9% uptime, LLM integrations in production, full-stack and mobile applications, and high-volume data pipelines.

I believe the best engineers today sit at the intersection of solid software engineering and AI — I bring both. Clean architecture and strong fundamentals on one side; production-ready generative AI and agentic systems on the other.

Also: Named author on two peer-reviewed publications (IEEE and US technical press) and a contributor on a granted US patent in sensor analytics and health monitoring application development — details available on request.

PG — Machine Learning & Artificial IntelligenceUniversity of Texas at Austin

Master's in EngineeringIndian Institute of Science (IISc), India

Teaching & Mentorship Part-time · Weekends only Mar 2023 — Oct 2024

Teaching Assistant & Mentor

University of Texas at Austin — AI & Machine Learning Certification Program

Mentored working professionals through UT Austin's AI & ML certification — covering Data Science, Machine Learning, and AI fundamentals. Supported learners with project reviews, concept clarification, and Q&A sessions. Delivered entirely on weekends (Saturdays & Sundays, ~4 hrs/day) while working full-time as a Software Engineer.

Skills

Technologies I know well

Built up over 5+ years of shipping real products across different stacks and domains.

🤖

LangChain / LlamaIndex

AI · LLMs

›

⚡

RAG Architecture

AI · LLMs

›

🧠

Agentic AI — CrewAI, AutoGen

AI · LLMs

›

✨

OpenAI / Anthropic / Gemini

AI · LLMs

›

🔬

Prompt Engineering

AI · LLMs

›

🤗

Hugging Face / Transformers

AI · ML

›

📊

TensorFlow / PyTorch

AI · ML

›

🔍

Embeddings / Semantic Search

AI · LLMs · Search

›

🛠

OpenAI Function Calling / Tool Use

AI · LLMs · Agents

›

📡

LangSmith / LLM Observability

AI · MLOps · Monitoring

›

☁

AWS Bedrock

AI · Cloud · LLMs

›

☕

Java / Spring Boot

Backend · Engineering

›

🐍

Python / Flask / FastAPI

Backend · Engineering

›

🔷

.NET / C# / ASP.NET Core

Backend · Engineering

›

⚙

Microservices / Distributed Systems

Backend · Architecture

›

🔗

REST APIs / GraphQL

Backend · API Design

›

🔐

OAuth2 / JWT Authentication

Backend · Security · APIs

›

🧪

Unit Testing — Pytest / JUnit

Backend · Quality · TDD

›

🐙

Git / GitHub / Version Control

Backend · DevOps · Collaboration

›

📋

AML / KYC Compliance Systems

Backend · Fintech · Compliance

›

🏥

HIPAA Compliant Systems

Backend · Healthcare · Security

›

⚛

React.js / Next.js

Full Stack · Frontend

›

🟩

Node.js / Express.js

Full Stack · Backend

›

📘

TypeScript / JavaScript

Full Stack · Programming

›

📱

React Native

Mobile · iOS

›

🍎

iOS App Development

Mobile · iOS

›

🔔

Push Notifications / Real-time

Mobile · Cloud

›

☁

AWS (EC2, S3, Lambda, RDS, Bedrock)

Cloud · Infrastructure · Serverless

›

🔵

Azure / Azure OpenAI Service

Cloud · DevOps

›

🐳

Docker / Kubernetes

Cloud · DevOps

›

🔄

CI/CD — Jenkins, GitHub Actions

DevOps · Automation

›

🔁

Cloud Data Migration — OCI to AWS S3

Cloud · Infrastructure · Shell Scripting

›

🌐

Cloud Architecture & Design

AWS · Azure · Multi-cloud · Scalability

›

⚙

Terraform / Infrastructure as Code

Cloud · DevOps · Automation

›

⚡

Redis / Caching Strategies

Cloud · Backend · Performance

›

⚡

Apache Spark / Kafka

Data · Engineering

›

📈

Power BI / SSRS

Data · Analytics

›

📊

Tableau

Data · BI · Visualization

›

🗄

PostgreSQL / SQL Server / DB2

Data · Databases

›

🔎

Vector DB — Pinecone, Chroma

AI · Data

›

🔀

ETL Pipelines / SSIS

Data · Engineering

›

🔁

End-to-End Data Pipelines

Airflow · Spark · Snowflake · dbt

›

🔧

dbt (Data Build Tool)

Data · Transformation · Analytics Engineering

›

🍃

MongoDB / Redis / DynamoDB

Data · Databases

›

⚡

SQL Optimization / Query Tuning

Data · Backend · Performance

›

📨

RabbitMQ / Message Queues

Backend · Messaging · Architecture

›

Projects

Things I've built

Production work across AI, backend, full stack, mobile, and data engineering. Client names withheld per confidentiality agreements.

AML Monitoring Platform

Code

AI / ML · Fintech

ML risk scoring with open sanctions data for real-time AML compliance screening. Replaced a fully manual review process for a fintech client.

Python MLMarbleOpenSanctionsPostgreSQLDockerAWS

AI RAG Knowledge Assistant

Code

Generative AI

Enterprise RAG with vector indexing and NL Q&A across 10,000+ documents. 35% better retrieval; 50K+ queries/month in production.

LangChainLlamaIndexVector DBOpenAIPythonAWS

Autonomous AI Agent

Code

Agentic AI

Production multi-agent system with planning, tool-calling, and autonomous execution. Reduced operational overhead 40% for a Series A startup.

LangChainCrewAIAutoGPTPythonFastAPIAWS Lambda

Voice-to-Text AI Agent Platform

Code

Agentic AI · Voice · Real-time

End-to-end multi-agent AI platform with 8 specialized agents working in coordination — from voice input to intelligent response via a live agent interface. Built for real-world production use with full AI infrastructure design and optimization.

Designed and built 8 coordinated AI agents handling voice-to-text, intent classification, task routing, execution, and response generation
Built live agent interface — real-time voice input processed end-to-end through the agent pipeline
Designed complete AI infrastructure: model serving, agent orchestration, latency optimization, and scalable deployment
Client details confidential — enterprise production deployment

Multi-Agent Systems Voice-to-Text LangChain Real-time AI Python FastAPI AWS AI Infrastructure

Education & Training Platform

Code

iOS · Mobile · Full Stack

Instructor + student dual apps with AI feedback analytics, cloud recording, and real-time communication for 100+ students per session.

AI/ML sentiment analysis on student feedback; automated quality scoring
Web application built alongside both mobile apps

iOS / MobileReact NativeAI/MLCloud StorageReal-time

Insurance & Fraud Detection

Code

Fintech · Insurance · Java · .NET

Auto, Travel, and Healthcare insurance apps plus real-time AML fraud detection monitoring millions of banking transactions — 24/7 global compliance monitoring.

Java.NETApache SparkSQLSSISSSRS

Academic Publishing Platform

Code

Enterprise · Backend · Java

Large-scale academic publishing with global distribution, multi-level approval workflows, and order processing at millions of records. Resolved critical production incident under high load.

JavaSQLDB2OracleWeb Technologies

BI & Analytics Dashboard

Code

Data / BI

End-to-end data pipelines and Power BI dashboards replacing manual weekly executive reports — multi-source ingestion and Python/SQL transformations.

Power BIPythonSQLApache SparkAWS Redshift

Data Processing Framework

Code

Data Engineering

Reusable enterprise framework — millions of records/second with pluggable ingestion, transformation, and output pipelines. Adopted across multiple internal projects.

PythonApache SparkETLDistributed SystemsSQL

Data API Platform for Analytics

⌥ Code

Backend · APIs · Python

Designed and built a scalable FastAPI backend platform delivering analytics datasets to internal applications and dashboards. Integrated with a centralized data warehouse exposing secure REST APIs for high-volume data retrieval.

API authentication, query endpoints, and performance optimization for high-volume requests
Architecture: Data Warehouse → FastAPI Backend → REST APIs → Applications & Dashboards

PythonFastAPIREST APIPostgreSQLBackend Development

Modern Data Pipeline Architecture

⌥ Code

Data Engineering · Snowflake · Power BI

Designed and implemented a scalable end-to-end data pipeline collecting data from external APIs, processing large datasets with Apache Spark, and loading into Snowflake for Power BI analytics dashboards.

Apache Airflow orchestrates pipeline workflows; Spark processes large-scale datasets
Architecture: API Sources → Airflow → Spark → Snowflake → Power BI

Apache SparkApache AirflowSnowflakePower BIPython

Real-Time Streaming Data Platform

⌥ Code

Data Engineering · Kafka · Streaming

Built a real-time data streaming platform processing high-volume event data continuously using Apache Kafka and Spark Structured Streaming, storing results in Databricks Delta Lake for analytics and monitoring.

Continuous event processing pipeline handling high-throughput real-time data streams
Architecture: Event Producers → Kafka → Spark Streaming → Databricks Delta Lake

Apache KafkaSpark StreamingDatabricksPythonDelta Lake

Open Source

GitHub Repositories

14 public repositories across AI engineering, full stack, backend APIs, Java, .NET, and data engineering. Every project ships real, working code.

📄DocMind AI

Real-Time GenAI Document Q&A Platform using LangChain, RAG, and LLMs for intelligent document understanding.

PythonLangChainRAGGenAI

🤖ai-pdf-chatbot-langchain-rag

AI PDF chatbot using LangChain and RAG architecture for intelligent document querying with OpenAI.

PythonLangChainRAGOpenAI

🧠AI-retrieval-agent-starter

Starter kit for building autonomous AI retrieval agents with tool-calling, planning, and reasoning.

PythonLangChainAgentsFastAPI

⚡SignalOps

Real-Time Streaming Data Platform — Apache Kafka, Spark Structured Streaming, Databricks Delta Lake.

KafkaSparkDatabricksPython

🔀StreamForge

End-to-End Data Engineering Platform — Airflow orchestration, Spark processing, Snowflake warehouse.

AirflowSparkSnowflakePython

🗄WarehouseForge

Enterprise data warehouse architecture with ETL pipelines, Power BI dashboards, and analytics reporting.

SnowflakeETLPower BISQL

🔷ShopSphere

Modern ASP.NET Core Full Stack Reference Application — .NET, C#, REST APIs, full stack architecture.

.NETC#ASP.NET CoreFull Stack

☕CareTrack

Full Stack Java Application with Spring Boot — healthcare domain, REST APIs, database integration.

JavaSpring BootREST APIMySQL

⚛fastapi-react-admin-boilerplate

Full stack boilerplate — FastAPI Python backend with React admin dashboard, ready to deploy.

FastAPIReactPythonPostgreSQL

🔐fastapi-postgres-auth-backend

Production-ready FastAPI + PostgreSQL authentication backend with JWT tokens and role-based access.

FastAPIPythonPostgreSQLJWT

🛍nextjs-shopify-storefront

Modern Next.js Shopify storefront — full stack e-commerce with TypeScript and Shopify Storefront API.

Next.jsTypeScriptShopifyReact

📝fastapi-blog-api-realworld

RealWorld spec blog API built with FastAPI — production-grade REST API with full CRUD and auth.

FastAPIPythonREST APIPostgreSQL

🏥springboot-clinic-management

Clinic management system built with Spring Boot — Java backend, appointment scheduling, REST APIs.

JavaSpring BootMySQLREST API

📋medium-clone-api-spec

Medium clone API specification — REST API design with full CRUD, auth, and social features.

REST APIAPI DesignBackend

View all repositories → github.com/rajakotid007

Research & IP

Publications & Patent

Granted Patent US Patent

Sensor Analytics & Monitoring Device — Intelligent Data Acquisition System

Named contributor · worked with patent attorney through full USPTO filing & prosecution

Patent details withheld per confidentiality agreement — available upon request

Granted & Published

Published Paper IEEE

Sensor Monitoring Systems & Intelligent Data Acquisition

Rajakoti Dasari, Prabhat Jain, Subhas Sarkar

Full citation available upon request

Peer-reviewed

Published Paper US Technical Magazine

ML-Driven Predictive Monitoring for IoT Sensor Networks

Rajakoti Dasari, Prabhat Jain, Subhas Sarkar

Full citation available upon request

Peer-reviewed

Blog

Writing

Thoughts on software engineering, AI systems, and building things that last.

🔍

📅 March 2025⏱ 6 min read

Why RAG is Still the Right Approach for Enterprise AI in 2025

Everyone is racing to fine-tune their own models. But after building RAG systems serving 50K+ queries a month in production, retrieval-augmented generation remains the most practical path for enterprise AI.

AI EngineeringRAGLLMs

Get in touch

Let's connect

Feel free to reach out — whether it's about a role, a collaboration, or just to connect.

Email

[email protected]

Location

Austin, TX — US Citizen

Phone

+1 (323) 451-1479

Name

Email

Subject

Message

Who I am

Technologies I know well

Things I've built

GitHub Repositories

Publications & Patent

Writing

The fine-tuning trap

What RAG actually solves

The retrieval quality problem

Lessons from production

The demo-to-production gap is enormous

The three things that actually matter

1. Constrain the action space ruthlessly

2. Build human-in-the-loop checkpoints

3. Observability is not optional

The reliability framework that worked for us

Get in touch