Portfolio · 2026

    Harmanpreet
    Singh.

    Data Scientist & AI Engineer.San Francisco Bay Area · currently at INVIDI.

    Scroll

    About

    How I got here.

    Harmanpreet Singh portrait

    I've always enjoyed building things, from scribbling logic in notebooks to shipping code that powers real products. Somewhere along the way, I got curious about the patterns behind the data. That curiosity led me into machine learning, and eventually into designing systems that don't just run, but learn.

    These days, I work at the intersection of software engineering and intelligence, building backend services, search systems, and ML pipelines. Lately I've been deep into semantic search and autonomous agents, and anything that turns raw data into real insight.

    I'm based in the San Francisco Bay Area. Outside of work, you'll usually find me hacking on side ideas or chasing good light with my camera. If you're curious or want to collaborate, reach out.

    Skills

    What I work on.

    01

    Agentic systems.

    Designing agents that use tools, hold memory, and act in loops without falling over past the demo.

    02

    Retrieval & semantic search.

    Embedding pipelines, hybrid search, and the eval loops that keep retrieval honest.

    03

    Production ML.

    Taking models from notebook to deployed APIs with infra that doesn't rot under traffic.

    Tools I reach for
    PythonTypeScriptRustJavaSQLJavaScript
    PyTorchHuggingFacescikit-learnTensorFlowWhisperNumPyPandas
    LangGraphLangflowMCPn8nLangChainOllama
    QdrantOpenSearchpgvectorPostgreSQLFirestoreSupabaseMongoDBRedis
    AWS SageMakerBedrockLambdaAPI GatewayGCPCloud RunTerraformDockerKubernetes
    FastAPITauriStreamlitReactGitGitHub ActionsCI/CDLinuxClaude Code

    Experience

    Where I've worked.

    A chronicle of the systems, teams, and problems I've shipped against.

    2024
    June 2024 — Present

    INVIDI Technologies

    Princeton, NJ

    Data Science & ML

    Leading data science initiatives focused on forecasting, optimization, and MLOps. Built MLOps pipeline using AWS SageMaker and API Gateway to deploy ML models as scalable, serverless endpoints. Currently building AI-powered semantic search systems for video content discovery and matching.

    PythonMachine LearningSemantic SearchComputer VisionAWSMLOpsTerraform
    2023
    May 2023 — May 2024

    INVIDI Technologies

    Princeton, NJ

    Software Developer Intern

    Developed advanced inventory scheduling system for efficient ad campaign delivery. Achieved 98% utilization rate of ad inventory.

    PythonAWS RedshiftData AnalysisScheduling AlgorithmsSQL
    2022
    August 2021 — August 2022

    Visa Inc.

    Bengaluru, India

    Senior Software Engineer

    Developed enterprise applications and ML-powered features for Visa's internal platforms. Improved retention and user experience by 40%.

    JavaJavaScriptPythonReact.jsMachine LearningREST APIs
    2019
    November 2019 — July 2021

    Cognizant Netcentric

    Pune, India

    Backend Engineer

    Developed backend solutions for major international clients including Kia Motors and InterContinental Hotels Group.

    JavaJavaScriptAEMAdobe AnalyticsMachine LearningAngularAWS
    2017
    December 2017 — October 2019

    Publicis Sapient

    Gurgaon, India

    Associate Technology

    Developed web solutions and ML models for clients and internal projects. Created analytics for marketing campaigns and customer profiles.

    JavaJavaScriptPythonAEMDjangoAnalyticsMachine Learning

    Education

    Where I learned.

    2024
    2022 — 2024

    Rutgers University

    New Brunswick, NJ

    Masters in Computer Science

    Advanced studies focused on machine learning, AI, and data science. Research Assistant on a longitudinal study of local news in New Jersey.

    Machine LearningArtificial IntelligenceData ScienceNatural Language ProcessingDeep Learning
    2017
    2013 — 2017

    Dr. B.R. Ambedkar NIT Jalandhar

    Jalandhar, India

    B.Tech, Electronics & Communication

    Built a strong foundation in engineering principles, programming, and signal systems. Developed alcohol-detection-based accident prevention technology as final-year project.

    ElectronicsCommunication SystemsProgrammingDigital Signal ProcessingEngineering Mathematics

    Projects

    Selected work.

    Hackathons, side ideas, and a few things I built to learn how something actually works.

    1st · Hackathon
    Agentic AI

    Vygil.

    Autonomous AI agent platform for activity tracking — uses computer vision, LLMs, persistent memory, and adaptive real-time decision-making.

    LLMAgentic AIMCPComputer VisionFastAPI
    3rd · Hackathon
    Privacy-first AI · Desktop

    Notebrew.

    Privacy-first AI meeting assistant — local Whisper/Parakeet transcription on-device with no bots joining your calls, dual audio capture, GPU acceleration on Metal/CUDA/Vulkan, and your choice of AI provider for summaries. Tauri desktop app for macOS and Windows. Currently shipping as a SaaS.

    TauriRustNext.jsWhisperParakeetLocal AI
    Voice AI · B2B

    AI Interview Platform.

    End-to-end voice-AI recruiting platform. Multi-state voice interview agent over web and phone, with silent mid-call extraction that compresses conversation into structured signal before the closing summary. Hardened with a layered prompt-injection defense for untrusted candidate input. Backed by hybrid semantic search (vector + BM25), LLM rubric scoring with must-have/preference classification and tech-adjacency rules, LinkedIn enrichment, and a full pipeline CRM.

    Voice AIRetellPrompt Injection DefenseHybrid SearchQdrantFastAPINext.jsFirestore
    LLM · Browser

    InsightWing.

    Chrome extension using FalconLLM and LangChain for 60-word web content summarization, with an interactive chat layer.

    LangChainChrome ExtensionJS
    Microservices

    Video Recommendation.

    Containerized, microservice-based recommendation system on FastAPI + Redis + Docker + Kubernetes with autoscaling and caching.

    FastAPIRedisDockerK8s
    RAG

    Document QA.

    Retrieval-augmented document QA on LangChain + HuggingFace + FAISS with efficient chunking and reranking.

    LangChainRAGFAISS
    Generative · CV

    StyleGAN.

    StyleGAN from scratch on FFHQ with Few-Shot GDA via Domain Re-modulation (DoRM) for cross-domain adaptation.

    GANPyTorchCV
    Neuromorphic

    SNN · ASL.

    Spiking neural network for sign-language recognition on the ASL Dynamic Vision Sensor dataset — 96.7% test accuracy.

    SNNCVPython
    Data Analysis

    Suicide Trends.

    Global socioeconomic patterns in suicide trends — GDP correlation analysis with age/gender breakdowns in R.

    RStatsViz

    Contact

    Let's talk.

    Book a coffee chat or send a note — whichever's easier.

    Schedule15 — 30 min

    © 2026 · Harmanpreet Singh