FLAGSHIP WEEK

WORLD TOUR

VIEW TALKS

Talk

On-demand

Virtual

Operating production RAG at platform scale under continuous change

Production RAG is a platform reliability problem. This talk shows how to operate version-aware retrieval across 121K+ enterprise docs with scalable data generation, regression-safe evaluation, and sub-200 ms latency under continuous corpus change.

Register

mins

Meet the speakers

Varun Kumar Kotte

Machine Learning Engineer, Adobe

Varun Kumar Kotte

Machine Learning Engineer, Adobe

Enterprise RAG systems face continuous documentation updates, strict SLAs, and version-sensitive correctness, making them infrastructure challenges, not demos. This talk presents a production architecture deployed for Adobe’s AI Assistant, operating over 121K+ enterprise documents with version-aware retrieval. The speaker will cover hybrid sparse+dense retrieval achieving 72.8% nDCG@4 with sub-200 ms P95 latency, scalable generation of 700K+ closed-domain QA pairs, and regression-safe evaluation pipelines combining automated and human review to prevent retrieval and answer-quality failures in production.

Operating production RAG at platform scale under continuous change

Register

Meet the speakers

Varun Kumar Kotte

Machine Learning Engineer, Adobe

Varun Kumar Kotte

Machine Learning Engineer, Adobe

Virtual

Register for PlatformCon 2026

Submit

Navigation

Live Day Paris

Live Day São Paulo

Live Day Sydney

Live Day SF & Valley

Past years

PlatformCon 2025

PlatformCon 2024

PlatformCon 2023

PlatformCon 2022

Join us

Youtube

LinkedIn

Platform Weekly

All rights reserved.

Powered by

x

Navigation

Live Day Paris

Live Day São Paulo

Live Day Sydney

Live Day SF & Valley

Past years

PlatformCon 2025

PlatformCon 2024

PlatformCon 2023

PlatformCon 2022

Join us

Youtube

LinkedIn

Platform Weekly

All rights reserved.

Powered by

x