Small language model

Small language models (SLMs) are artificial intelligence language models designed for human natural language processing including language and text generation. Unlike large language models (LLMs), small language models are much smaller in scale and scope.

Typically, an LLM's number of training parameters is in the hundreds of billions, with some models even exceeding a trillion parameters. The size of any LLM is vast because it contains a large amount of information, which allows it to generate better content. However, this requires enormous computational power, making it impossible for an individual to train a large language model using just a single computer and GPU.

Small language models, on the other hand, use far fewer parameters, typically ranging from a few million to a few billion. This make them more feasible to train and host in resource-constrained environments such as a single computer or even a mobile device.{{cite web|url=https://www.ibm.com/think/topics/small-language-models|title= What are small language models? |author=Rina Diane Caballar|date= 31 October 2024 |publisher=IBM}}{{cite web|url=https://huggingface.co/blog/jjokah/small-language-model|title=Small Language Models (SLM): A Comprehensive Overview|author=John JOhnson|date=25 February 2025 |publisher=Huggingface}}{{cite web|url=https://www.weforum.org/stories/2025/01/ai-small-language-models/|title=What is a small language model and how can businesses leverage this AI tool?|author=Kate Whiting|publisher=The World Economic Forum}}{{cite web|url=https://learn.microsoft.com/en-us/shows/data-exposed/slm-small-language-model-with-your-data-data-exposed|title=SLM (Small Language Model) with your Data|date=11 July 2024 |publisher=Microsoft}}

See also

References