Well, like everything in life, it depends. We offer a customer support automation platform and spend a large amount of our time and resources in evalu

So, which LLMs are the best for building a AI chatbot?

submited by
Style Pass
2024-04-03 05:30:04

Well, like everything in life, it depends. We offer a customer support automation platform and spend a large amount of our time and resources in evaluating, benchmarking, and deploying the most optimal Generative AI models for our customers. In this article, we share our learnings and takeaways in evaluating the popular Large Language Models (LLMs), particularly in the domain of customer support automation: LLama 2, Mistral, GPT-4, and GPT-3.5. We also evaluated the performance of these models across different providers such as OpenAI, Azure, and other emerging provider platforms.

While there are several benchmarks and results available online when it comes to the out-of-the-box performance of popular LLMs, we wanted to evaluate specifically for the customer support domain. We prioritized certain dimensions: Accuracy, Speed, Proprietary, Cost, Lack of Hallucinations, and Instruction Following.

Accuracy and correctness of responses are the most impactful elements for us and our customers as we are not just building demo chatbots but actual products that are deployed in production. We value our customers' brand and trust highly and ensure that only accurate answers without hallucinations are generated.

Leave a Comment