All cases

Gen AI sales training platform with a RAG architecture

Client
A technology company developing a scalable solution to accelerate sales team onboarding and reduce ramp-up time
Industry
Technology, corporate education
Services
Gen AI consulting, Gen AI development, RAG
Tech stack
OpenAI GPT-4, Mistral 7B, OpenAI Embeddings, SentenceTransformers, custom RAG pipeline, Python, custom Python libraries for file parsing, Java, Microsoft Azure (Azure Service Bus, SQL Server, and Azure Blob Storage)

Challenge

Traditional sales onboarding is a notoriously slow and resource-intensive process. According to industry data, it can take three to six months for a new representative to become fully productive, and research from institutions such as DePaul University estimates that the total cost of hiring, training, and ramping up a single sales rep can exceed $100,000. Our client, a SaaS company based in the United States, wanted to use generative artificial intelligence to solve this expensive industry-wide problem. While they had extensive experience in corporate training, they required a specialized engineering partner to handle the intricacies of generative AI development. That is why they hired ITRex to design and develop their platform's core Gen AI functionality.

Solution

As the core engineering partner, ITRex developed a generative AI training platform that personalizes the learning process at scale. Our approach was based on a modular, purpose-driven language model (LLM) architecture, ensuring that each component of the training experience was precise, scalable, and efficient. ITRex developed the solution using a sophisticated retrieval-augmented generation (RAG) pipeline to generate a high-fidelity knowledge base. We devised a multi-stage procedure to avoid the common pitfalls of LLM hallucination and content repetition:
Advanced data processing. The ITRex R&D team created custom parsers for a variety of source documents (PDFs, PPTX, DOCX, audio/video subtitles) to normalize all incoming data into structured text format.
Intelligent chunking & embedding. To improve semantic segmentation, we implemented an adaptive chunk splitter that used positional encoding. This ensured that the context fed into the model was always relevant. The processed chunks were then transformed into vector embeddings via the OpenAI Embeddings model and domain-specific SentenceTransformers.
Few-shot learning for factual consistency. To further ground the model's output in reality, we enhanced our advanced RAG pipeline with few-shot learning. By providing the model with curated, high-quality question-and-answer pairs directly within the prompt, we guided its responses to be more factually consistent and less repetitive, achieving high accuracy without retraining the model.
With a solid knowledge base in place, ITRex used specialized Gen AI components to develop the platform's core features, such as automated lesson generation, dynamic personalization based on resume and role analysis, and a real-time interactive Q&A module. A project of this magnitude involved overcoming several engineering hurdles. Among the primary challenges and proposed solutions were:
Reducing hallucinations and repetitions during lesson generation. Early LLM outputs were frequently repetitive or contained hallucinated content that did not match the source documents. To tackle the problem, our R&D engineers enhanced the RAG pipeline with an adaptive chunk splitter and introduced retrieval-based filtering layers. Combining this technique with few-shot learning on curated examples resulted in a strong, multi-layered defense against inaccurate content generation.
Personalizing educational content across different organizations. We discovered that seniority definitions, such as "junior" or "senior," differed greatly between clients, making resume-only personalization inconsistent. We addressed the issue by configuring the system to perform dual analysis. The platform compares parsed CV data to role requirements extracted from the client's own company documents, dynamically adjusting lesson complexity and ensuring relevance.
Achieving real-time performance and low latency. The LLM powering the platform was initially hosted in the cloud (Azure) and accessed via APIs. This introduced a lag during real-time Q&A sessions. We solved the puzzle by benchmarking providers and, after careful consideration, migrating the LLM-related services to a direct OpenAI API endpoint. The switch was handled with minimal disruption thanks to the platform's modular architecture, significantly improving response time and user experience.
Gen AI Sales Training Platform with a RAG Architecture
Gen AI Sales Training Platform

Impact

The Gen AI engine developed by ITRex became the foundation of the client's platform, allowing them to offer a SaaS product that delivers transformative results for their customers. Companies that utilize the platform can now:
Reduce sales onboarding time by up to 92%. Early studies conducted by the client suggest that the platform reduces the training lifecycle for new sales representatives from an industry average of six months down to just two weeks.
Automate course creation. Our fine-tuned LLM can process an entire library of internal documents and generate a complete, tailored training course in 4-5 hours, a task that used to take months of manual effort.
Increase senior staff productivity. The Gen AI platform takes away the burden of time-consuming training and assessment tasks from senior sales managers, allowing them to focus on high-value activities such as strategy and deal closing.

Latest projects