TinyFish Accelerator ยท Pune, India

Kunal
Raha.

AI Engineer ยท Solo Builder ยท Backend Systems

I build AI-native systems, RAG architectures, and automation tools using modern backend + LLM stacks. From 0 to production in days.

3hโ†’3m
Procurement time reduction
140+
Distributors integrated
10+
Beta users in production
scroll
Accelerator Experience

TinyFish Accelerator

Selected for TinyFish AI Accelerator. Built and shipped OmniProcure from concept to real users during a high-pressure sprint week.

โšก

Sprint Execution

Built OmniProcure end-to-end during the sprint. API integration, AI layer, auth, and deployment all shipped under time pressure.

๐Ÿ”—

140+ Distributors

Integrated OEM Secrets API to surface real-time supplier data across 140+ hardware distributors including Mouser, DigiKey, LCSC.

๐Ÿค–

AI-Powered Part Resolution

Natural language queries like "RP2040" resolve to exact SKUs like SC0914 and match across suppliers automatically.

๐Ÿงช

Real Beta Users

Shipped to 10+ hardware manufacturers (0 to 35 years experience). Collected live feedback and iterated in real time.

3hโ†’3m
Workflow Speed
140+
Distributors
10+
Beta Users
Phase 2
Selected
View Live Product โ†—
Featured Work

Things I've Shipped

Real systems, real deployments, real users. Not tutorials.

Live Product

OmniProcure

AI-powered procurement automation that streamlines hardware sourcing. Uses AI for intelligent part number resolution (e.g. "RP2040" to SC0914), integrates 140+ distributors via OEM Secrets API, and serves per-supplier cards with quantity selectors. Guest flow with OAuth gating for PO generation and cron-based async monitoring for automated workflow execution.

โ†—Reduced sourcing time from 3 hours to 3 minutes
โ†—140+ distributors integrated via OEM Secrets API
โ†—10+ beta users from hardware manufacturing (0 to 35 yrs experience)
โ†—Cron-based async monitoring for automated workflow execution
โ†—Guest flow with OAuth-gated PO generation
TypeScriptReactSupabaseAI APIsRailwayOEM Secrets API
๐Ÿ›’
Open Source

Semantic Search Microservice

High-performance RAG-based semantic search engine optimized for CPU inference. BGE-M3 embeddings into ChromaDB, returning ranked context-aware results via FastAPI at ~1s latency on CPU.

โ†—~1s semantic retrieval latency on CPU
โ†—BGE-M3 dense embedding model
โ†—ChromaDB vector store with FastAPI ranked output
PythonFastAPIBGE-M3ChromaDBDocker
๐Ÿ”
Agent Infrastructure

Context Memory Worker

Local-first persistent memory for AI agents enabling long-term contextual recall. LanceDB and Ollama embeddings for semantic retrieval across sessions. Built as a node for the Aden Hive agent framework.

โ†—Persistent memory across agent sessions
โ†—Semantic retrieval via LanceDB vector store
โ†—Dockerized for agent orchestration systems
PythonLanceDBOllamaDocker
๐Ÿ
Tech Stack

How I Build

Full-stack AI systems from embedding pipelines to deployed products.

๐Ÿ’ฌ
Languages
TypeScriptPython
โš™๏ธ
Backend
FastAPINode.jsREST APIs
๐ŸŽจ
Frontend
ReactFramer
๐Ÿง 
AI Layer
RAG PipelinesBGE-M3Claude APIOllamaAgent Memory
๐Ÿ—„๏ธ
Databases
SupabaseChromaDBLanceDB
๐Ÿš€
Infra and Tools
DockerRailwayGitHubPM2Nginx
Build Philosophy
01API-first system design
02LLM + backend prototype fast
03Deploy early for real feedback
04Iterate on user behavior
05Optimize shipping speed over perfection
06Sell what people need, not what I like
Get In Touch

Let's Build Something

Open to collaborations, contracts, and conversations.