
Large language models (LLM) are advanced artificial intelligence (AI) systems specifically engineered to comprehend, interpret, and generate human-like text from a wide array of inputs. Leveraging state-of-the-art machine learning (ML) techniques, massive training datasets, and transformer architectures, these models can accomplish tasks ranging from translation, summarization, question answering, and conversation to more nuanced applications such as sentiment analysis, text classification, and creative content generation. LLMs are often integrated into existing applications and systems to automate language-heavy tasks such as powering conversational interfaces and supporting reasoning-driven insights.
LLMs differ from small language models (SLMs) primarily in scale, especially in parameter counts and the volume of training data used. LLMs typically have parameter sizes ranging from 10 billion to trillions of parameters, and SLMs have a few million to upwards of 10 billion parameter sizes. This category also differs from the AI chatbots software category, which focuses on standalone platforms that allow users to interact and engage with large language models, and the synthetic media software category, which consists of tools for business users to create AI-generated media. These LLM solutions, instead, are designed to be more versatile and foundational and can be integrated into a wide range of applications, not just limited to chatbots or synthetic media.
LLMs are typically either open-sourced or closed-sourced/proprietary. Open source models are freely downloadable and modifiable, with model weights and training codes being publicly available. Closed-source LLMs do not have source and model weights publicly downloadable, and are only available via API or endpoints. Additionally, some LLMs have reasoning capabilities, which help break down complex problems, apply logic, and follow thought processes to map out a solution. LLMs without reasoning capabilities, also known as base models, are focused on next-token predictions to predict patterns. Reasoning capabilities may be slower and more deliberate, whereas non-reasoning LLMs are faster.
To qualify for inclusion in the Large Language Models (LLM) category, a product must:
G2 takes pride in showing unbiased reviews on user satisfaction in our ratings and reports. We do not allow paid placements in any of our ratings, rankings, or reports. Learn about our scoring methodologies.
A weekly snapshot of rising stars, new launches, and what everyone's buzzing about.