Allenai Models

AllenAI Models

allenai

allenai.org/language-models (opens in a new tab)

olmo

olmo (opens in a new tab)

  • Open Language Model (OLMo) is a framework intentionally designed to provide access to data, training code, models, and evaluation code necessary to advance AI through open research by empowering academics and researchers to study the science of language models collectively.

  • Open Language Model: OLMo. A highly performant, truly open LLM and framework intentionally designed with access to the data, training code, models, and evaluation code necessary to advance AI and study language models collectively.

  • What OLMo provides for researchers and developers

    • More transparency. With full insight into the training data behind the model, researchers can work more efficiently and bypass the need to rely on qualitative assumptions of model performance.
    • Less carbon. By opening the full training and evaluation ecosystem, we can radically reduce developmental redundancies, which is critical in the decarbonization of AI.
    • Lasting impact. By keeping models and their datasets in the open rather than hidden behind APIs, we enable researchers to learn and build from previous models and work.
  • dolma dataset, used for training olmo

Tulu

https://huggingface.co/collections/allenai/tulu-v25-suite-66676520fd578080e126f618 (opens in a new tab)


AllenAI Multi-modal models

https://allenai.org/multimodal-models (opens in a new tab)