Available for select product and platform conversations

Rahul Dhiman

Software Engineer | Backend, Search, AI & ML Systems

I design high-throughput backend systems, AI-powered support platforms, and ML-driven forecasting products that make complex problems feel fast, reliable, and commercially useful.

Software engineer with ~4 years of experience building enterprise search, low-latency APIs, production-grade LLM workflows, and machine learning systems for predictive analytics and revenue intelligence. My work sits at the intersection of distributed systems, information retrieval, applied AI, and product execution.

GitHub LinkedIn hi@rahuldhiman.comFounder-friendly, recruiter-friendly, execution-first

Current focus

Search, retrieval, and AI systems that hold up in production.

50K+

Production users

10M+

Documents indexed

91%

Forecast accuracy

65%

Latency reduction

Quick actions

Open the action palette to print, share, copy contact details, or send the resume without hunting through the page.

What I do

The resume story in one read.

A concise overview for hiring managers, founders, and teams that need both systems depth and product judgment.

I build the backend systems and machine learning pipelines that power search, support, and AI-driven customer experiences at scale. My strength is translating deep technical constraints — from low-latency APIs to predictive analytics models — into products that feel immediate and reliable for end users.

I am at my best on platform-heavy problems: search relevance, ingestion pipelines, API performance, time series forecasting, AI retrieval systems, fault tolerance, and agentic workflow automation that touches real business outcomes.

I am targeting senior software engineering and ML engineering roles where systems design, search, AI integration, predictive analytics, and execution quality all matter. The differentiator I bring is practical depth with a product mindset.

Scalable backend architecture & FastAPI services

Search relevance, indexing, and retrieval

Machine learning & time series forecasting

LLM systems with grounding, agentic orchestration, and LangGraph

Positioning

Technical depth with commercial intent.

The through-line across the work: systems that scale, interfaces that serve real teams, and AI that is measured by outcomes rather than novelty.

Enterprise search at scale

Shipped search and retrieval systems where performance, relevance, and uptime materially affect support efficiency.

AI that survives production

Focused on grounded LLM workflows with fallbacks, measurable outcomes, and operational confidence.

Predictive ML for revenue-critical decisions

Built time series forecasting and risk-scoring models that feed directly into pricing and overbooking decisions — where accuracy has a direct dollar impact.

Operator mindset

I care about latency, observability, failure modes, and maintainability as much as shipping speed.

Voice and multimodal systems

Built real-time voice pipelines that orchestrate speech, context, and response generation with tight latency budgets.

Experience

Built for scale, measured by impact.

A structured timeline of the roles, systems, and outcomes that define the current body of work.

Software Engineer

Grazitti Interactive (SearchUnify)

2023 - Present

Hybrid / Remote

Own backend and AI-heavy initiatives for an enterprise search platform used across customer support and knowledge workflows.

Designed and scaled enterprise search systems serving 50K+ users across large support environments.
Built indexing pipelines for 10M+ documents across CRM, knowledge base, and internal content systems.
Reduced p95 latency from 400ms to 140ms through query optimization, caching strategy, and API tuning.
Developed backend services in Node.js and Python handling 1K+ QPS under production workloads.
Led Salesforce Service Cloud integration that improved knowledge recommendations and cut ticket resolution time by ~25%.
Architected LLM-powered support automation using agentic workflows and RAG-based retrieval, reducing repetitive support queries by 30% and cutting average handle time by 25%.
Designed and deployed multi-agent AI orchestration (LangGraph) enabling complex, multi-step query resolution without human escalation.
Improved reliability with fallback logic, production monitoring, and alerting around search failure states.

Node.jsPythonFastAPISearchRAGLangGraphSalesforceAWS

Software Engineer

Grazitti Interactive

2022 - 2023

On-site / Hybrid

Worked on search infrastructure foundations, ingestion pipelines, and production debugging across distributed services.

Built REST APIs and ingestion pipelines for search-oriented backend systems.
Designed batch and streaming workflows for document ingestion across multiple data sources.
Reduced ingestion failures by ~40% with retry orchestration, validation, and stronger failure recovery.
Debugged production issues in distributed services and improved service resilience.

Outcome

Designed ranking, deduplication, and multi-source indexing for 10M+ documents across enterprise environments.

Enterprise SearchSearch RelevanceScale

See capabilities

Expertise

Organized by capability, not buzzwords.

A signal-dense breakdown of the domains I ship in most often.

Systems and backend design

High-signal execution on APIs, throughput, reliability, and operational safeguards.

Search quality and data pipelines

Deep comfort with indexing, retrieval, relevance, and scale-sensitive search experiences.

Applied AI and machine learning

RAG systems, agentic flows, time series forecasting, and pragmatic ML integration into real user workflows.

Backend & Distributed Systems

Node.jsPythonFastAPIREST API DesignMicroservicesAsync ProcessingLow-latency Services

Search & Retrieval

ElasticsearchSolrIndexing PipelinesRankingQuery OptimizationDeduplication

AI & LLM Systems

RAG PipelinesAgentic WorkflowsLangGraphLangChainPrompt EngineeringMulti-Agent OrchestrationFallback LogicEvaluation & Evals

Machine Learning & MLOps

Time Series ForecastingPredictive AnalyticsXGBoostProphetScikit-learnFeature EngineeringModel EvaluationDynamic Pricing ModelsRevenue ManagementDocker-based Model Serving

Data & Persistence

PostgreSQLRedisSQLData ModelingChunking StrategiesKnowledge SystemsContext RetrievalTime Series Data

Cloud & Delivery

AWS EC2S3RDSLambdaCI/CDDockerProduction Monitoring

Product & Operations

Service CloudSupport AutomationCross-functional DeliveryReliability Engineering

Metrics

Numbers that make the story easy to scan.

The signal most recruiters and engineering leaders want to find quickly: scope, load, performance, and measurable change.

Years building production systems

0K+

Enterprise users served

0M+

Documents indexed across systems

p95 latency reduction on key flows

Repetitive support queries reduced

Occupancy forecast accuracy (MAPE <9%)

No-show revenue leakage reduced via ML overbooking model

Simulated RevPAR improvement from dynamic pricing recommendations

Education

Compact, clear, and in service of the work.

Formal education is presented with restraint so the professional signal stays in focus.

Completed

Masters of Computer Application

Kurukshetra University

Contact

Built to convert curiosity into conversation.

If you are hiring for backend platforms, search, AI systems, or product infrastructure, this site should make the next step obvious.

hi@rahuldhiman.com

Direct response path for opportunities, portfolio requests, and architecture conversations.

India · IST (UTC+5:30)

Open to full-time, consulting, and high-impact platform work

GitHub LinkedIn Share profile Send resume