Open to Senior AI Architect & GenAI Solution Architecture roles · Remote-first · International

Abhishek
Kumar.

Solution Architect

Ten years turning ambiguous business and regulatory requirements into governed, production-ready AI. I own solution architecture for enterprise compliance and document-intelligence systems — defining classification taxonomies, retrieval architectures, and LLM integration patterns, and naming the trade-offs that make them scale, survive an audit, and ship.

See what I’m building Get in touch

Abhishek Kumar sitting on stone steps in a white shirt and sunglasses, string lights and a tiled-roof building behind him. Black and white. — Bangalore-based. Internet-based.
Currently mid-thought.

~/abhishek — zsh

Now · what’s on the desk

A short, dated list of what’s top of mind.

Last updated: May 2026 · in the spirit of nownownow.com

JobPilot — a personal job-hunt copilot for the v0 global hackathon. Next.js + Neon Postgres + Apify on Vercel. Shipping the v0 submission this week.building
Porting the Superpowers plugin to Kiro IDE on Windows — pairing on Windows is a love story I’m still negotiating.porting
Publishing Claude Agent Skills: v0-prompting, humanizer, AI Ark, 360Brew profile optimizer, and a LinkedIn content one I keep tweaking at 2am.shipping
Quietly exploring international remote roles and Senior AI Architect positions at AI-native companies. (Yes, this is a hint.)listening
Reading: PageIndex paper, agentic eval harnesses, anything Anthropic’s research org puts on arxiv at 11pm IST.reading

What I do

Four outcomes, not a wall of skills.

I think people hire architects for outcomes, not for buzzword bingo. So here’s the work, grouped by what it actually decides.

01 / architecture

GenAI Solution Architecture

Translating ambiguous business and regulatory requirements into reference architectures. Technology selection, integration patterns, schema and API design, scalability, and production-readiness.

Reference architecture Tech selection Integration design Production-readiness

02 / retrieval

Retrieval Architecture & Trade-offs

Designing and formally evaluating retrieval strategies — RAG, PageIndex, hybrid, reranking — with evaluation harnesses that make the approach-selection call defensible, not vibes-based.

RAG PageIndex Hybrid retrieval Eval harnesses RAGAS

03 / governance

Governance & Compliance by Design

Audit-ready systems with classification taxonomies, PII redaction, model risk assessment, and explainability — aligned to GDPR, EU AI Act, and enterprise security KPIs.

GDPR EU AI Act PII redaction Audit-readiness Explainability

04 / multi-cloud

Multi-cloud & MLOps Architecture

Reference implementations that transfer cleanly across AWS Bedrock, Azure AI, and GCP. Model lifecycle, drift detection, CI/CD, capacity planning, monitoring.

AWS Bedrock Azure AI GCP MLOps Drift detection

— how I think

Architecture is the art of naming what you gave up.

Four principles I bring to every system.

Trade-offs over trends.

Every choice sacrifices something. I name what I gave up and why — that’s the line between picking a tool and architecting a system.

Governed by default.

Compliance, auditability, and explainability are designed in from day one — not bolted on the week before the audit.

Evaluation before production.

Formal quality gates and evaluation harnesses gate every prompt and retrieval change. Nothing ships on “looks good.”

Portable by design.

Patterns that transfer cleanly across Bedrock, Azure, and GCP — so a cloud choice is never a one-way door.

Patterns proven across telecom, healthcare, pharmaceuticals, and retail.

Selected work

Six things I’ve shipped, roughly in order of recency.

Pulled from the resume. Each one has a working artifact behind it — happy to walk through eval results, retrieval configs, or the bits that almost shipped sideways.

2026
now

Compliance RAG Pipeline

Production

“Claude reads 300+ pages of regulatory text and cites its sources.”

End-to-end production RAG for a Tier-1 enterprise engagement: Claude Sonnet on Bedrock, Qdrant for hybrid retrieval, FastAPI orchestration, RAGAS gates — the decision framework other compliance workflows now follow.

Retrieval evaluation architecture pattern — RAG vs PageIndex comparison flow with evaluation gates

Claude SonnetBedrockQdrantRAGASPageIndexFastAPI

2025

Enterprise Asset Classification at Scale

Production

“Hundreds of asset classes, structured outputs, a multi-week process killed.”

LLM classifier on Claude/Bedrock for a regulated enterprise client: structured output validation, confidence scoring, rationale generation, and a human-in-the-loop review queue for audit defensibility. Replaced a multi-week manual categorization with near-real-time.

Compliance classification architecture pattern — structured-output validation with human-in-the-loop review queue

ClaudeBedrockstructured outputsHITLgovernance

2025

JobPilot

In flight

“A job-hunt copilot that actually reads the listings.”

Personal project, v0 global hackathon submission. Next.js + Neon Postgres + Apify scrapers on Vercel, with retrieval over my own work history and a small evaluator for fit. Currently being dogfooded on my own search.

Next.jsNeonApifyv0personal

2024

Claude Interview Platform

🏆 Hackathon Winner

“$10K and a year of validation.”

Winner of the AWS Global GenAI Hackathon. Claude on Bedrock with dynamic, context-aware question generation and RAG-grounded transcript evaluation against role competencies. Prompt orchestration, structured-output scoring, quality validation across both generated and evaluated content.

ClaudeBedrockRAGeval-LLM$10K prize

2024

PharmaGraph

Production

“A knowledge graph that explains itself to a pharmacist.”

Neo4j knowledge graph with LLM-based entity linking and a RAG retrieval interface for complex pharmacological queries, built for a large US healthcare deployment. Every response grounded to a source document with inline citation and auditor-visible provenance. Azure OpenAI + Azure AI Search.

Neo4jAzure OpenAIAI Searchgrounded RAGprovenance

2025

GTMind

Personal

“Multi-tenant B2B prospecting, wrapped in MCP.”

Multi-tenant B2B prospecting SaaS exposed as an MCP server. FastAPI + Anthropic MCP SDK. The clients pull leads through Claude — the agent does the boring part, the human does the call.

FastAPIMCPAnthropic SDKmulti-tenant

Side quests

Six was an arbitrary number — here’s the rest of the rotation.

More builds, less depth. Same person.

#07 Production

Document intelligence layer

The ingestion layer downstream compliance and retrieval workflows stand on.

LLM-light document-to-structured-content service. Powers downstream compliance, classification, and retrieval workflows for a Tier-1 enterprise. The unglamorous infrastructure that everything else stands on.

FastAPIPythonOCRStructured extractionVector store

#08 Production

Multimodal document processing pipeline

Compliance review across docs, slides, spreadsheets, and images.

Extended a text-only compliance pipeline to handle PPTX, XLSX, embedded images, and OLE objects using Claude Vision. The pipeline now reads anything an auditor would actually open.

Multimodal document intelligence architecture pattern — multi-format ingestion via vision-capable LLM

Claude VisionFastAPIAPI gatewayPython

#09 R&D

LinkedIn Connection Intelligence

Multi-agent RAG over personal LinkedIn graph data.

Ingests a LinkedIn connection graph, enriches it, and surfaces second-order paths and warm introductions. Pitched to the Vercel AI Accelerator. The “LinkedIn GPT for individuals” idea.

LangGraphQdrantNeo4jPostgresNext.js

#10 Built

Voice Reservation Agent

Conversational restaurant booking over the phone.

Takes restaurant reservation calls end-to-end — handles availability, party size, dietary notes, and confirmation, then writes to the booking system. The first time hearing it pick up was uncanny.

n8nElevenLabsTwilioOpenAI

#11 Built

ICP Qualifier Agent

Lead qualification agent on a partner agent platform.

Qualifies inbound leads against an ideal customer profile — pulls firmographic and intent signals, scores, routes. Built on the Nasiko A2A platform as part of their agent ecosystem.

NasikoPythonAnthropic SDK

#12 Personal

OpenTravel

Travel chatbot grounded in YouTube creator transcripts.

Niche RAG that treats travel YouTube creators as the knowledge base. Extracts transcripts, builds a hybrid retriever, answers travel questions with cited timestamps. The internet’s actual travel guide, finally queryable.

YouTube APIOpenAIRerankingEmbeddings

#13 Personal

HydroLedger

Car wash management system, no-code first.

End-to-end ops for a small car wash — bookings, customer ledger, daily reports. Built to test how far Lovable + Supabase can take you before you reach for code. Further than expected.

LovableSupabaseReact

Professional journey

A decade, four chapters, one consistent through-line.

Started in analytics, ended up in production GenAI. The titles shifted; the work — turning messy real-world data into systems people can trust — didn’t.

2015 → 2020 Analytics & Data Science (multiple roles)

Gambit Sports · Rooster Properties · Jupiter Infrastructure

Started in sports analytics and operational analytics; built predictive models, ETL pipelines, and early ML systems across sports, real estate, and infrastructure verticals.

2021 → 2023 Lead Business Analyst

Practo Technologies · healthcare consumer tech

Marketing Mix Modeling, real-time campaign optimization, user segmentation. First serious production ML work — the kind that has SLAs.

2023 → 2025 Manager, Data Science & Analytics

Factspan Analytics

Transitioned into GenAI leadership. Built production RAG, knowledge graphs, and MLOps infrastructure. Won the AWS Global GenAI Hackathon during this period.

2025 → now Digital Engineering Staff Engineer, AI & GenAI

NTT DATA

Leading enterprise RAG delivery, LLM evaluation frameworks, and AI governance for Tier-1 clients. Architecting parallel cloud-native reference implementations across AWS Bedrock and Azure OpenAI.

Community & speaking

7,000 practitioners, 40+ events, one chai habit.

I lead the Bangalore chapter of The AI Collective — the largest practitioner community in India for production AI engineers. Since March 2025, we’ve shipped 40+ events, hosted speakers from across the ecosystem, and built the kind of room where people argue about eval harnesses and then go get dinner.

Apr 2026 · Bangalore

Claude Code Meetup

200 attendees · speaker + organizer

Mar 2026 · Bangalore

Vercel v0 Buildathon

Buildathon co-lead

Feb 2026 · Microsoft

GenAI Demo Day 2.0

Hosted at Microsoft Bangalore

Jan 2026 · Bangalore

Genspark Event

80 attendees · talk + Q&A

2025 — ongoing

Andela — AI for DevOps

Cohort facilitator (paid)

Since Mar 2025

AI Collective — Bangalore

Chapter lead · 7,000+ members

In partnership with

Microsoft Anthropic Vercel Genspark Andela

Writing & open source

Mostly Claude Agent Skills and what broke.

I publish small, sharp agent skills — opinionated prompts and tools designed to make Claude better at one specific thing. A few are below; the rest are in drafts.

Published agent skills

v0-prompting humanizer / de-AIs your copy AI Ark OpenClaw 360Brew — profile optimizer LinkedIn content

Soon · essay “What I got wrong about RAG eval (twice).”

Soon · teardown “PageIndex vs chunked RAG: an honest benchmark.”

Soon · field notes “Running a 5K-person AI community on weekends.”

Credentials

The short version of the long PDF.

Download resume (PDF)

M.S. — Machine Learning & AI Liverpool John Moores University, UK

2019 → 2021

PG Diploma — ML & AI IIIT Bangalore

2019 → 2020

B.Tech — Engineering Manipal Institute of Technology

2012 → 2015

🏆 AWS Global GenAI Hackathon Winner GenAI Academy · Green Belt Azure AI Fundamentals Azure Generative AI GCP Intro to GenAI deeplearning.ai · DL Specialization CSPO® · Scrum Alliance

Working on something interesting? Send a note.

I read everything. Best for: AI / GenAI Solution Architect conversations at AI-native companies, production-RAG and governance consulting, community collabs, or just a good thread about retrieval trade-offs. Slow for: cold sales pitches.

Email abhishek.lagran@gmail.com LinkedIn abhishek-kumar-aiml X / Twitter @__iamabhishek WhatsApp +91 83103 38891····· ·····

Abhishek Kumar.

A short, dated list of what’s top of mind.

Four outcomes, not a wall of skills.

GenAI Solution Architecture

Retrieval Architecture & Trade-offs

Governance & Compliance by Design

Multi-cloud & MLOps Architecture

Architecture is the art of naming what you gave up.

Trade-offs over trends.

Governed by default.

Evaluation before production.

Portable by design.

Six things I’ve shipped, roughly in order of recency.

Compliance RAG Pipeline

Enterprise Asset Classification at Scale

JobPilot

Claude Interview Platform

PharmaGraph

GTMind

Six was an arbitrary number — here’s the rest of the rotation.

Document intelligence layer

Multimodal document processing pipeline

LinkedIn Connection Intelligence

Voice Reservation Agent

ICP Qualifier Agent

OpenTravel

HydroLedger

A decade, four chapters, one consistent through-line.

7,000 practitioners, 40+ events, one chai habit.

Mostly Claude Agent Skills and what broke.

The short version of the long PDF.

Working on something interesting? Send a note.

Tweaks ×

Abhishek
Kumar.