IBM Granite

Granite
Developer(s)IBM Research[1]
Initial releaseNovember 7, 2023; 12 months ago (2023-11-07)
PlatformIBM Watsonx (initially)
GitHub
Hugging Face
RHEL AI
Type
LicenseProprietary
Code models: Open Source (Apache 2.0)[2]

IBM Granite is a series of decoder-only AI foundation models created by IBM. It was announced on September 7, 2023,[3][4] and an initial paper was published 4 days later.[5] Initially intended for use in the IBM's cloud-based data and generative AI platform Watsonx along with other models,[6] IBM opened the source code of some code models.[7] Granite models are trained on datasets curated from Internet, academic publishings, code datasets, legal and finance documents.[8][9][1]

  1. ^ a b McDowell, Steve. "IBM's New Granite Foundation Models Enable Safe Enterprise AI". Forbes.
  2. ^ ibm-granite/granite-code-models, IBM Granite, 2024-05-08, retrieved 2024-05-08
  3. ^ Nirmal, Dinesh (September 7, 2023). "Building AI for business: IBM's Granite foundation models". IBM.
  4. ^ "IBM debuts Granite series of hardware-efficient language models". September 7, 2023.
  5. ^ "Granite Foundation Models" (PDF). IBM. 2023-11-30.
  6. ^ Fritts, Harold (2024-04-22). "IBM Adds Meta Llama 3 To watsonx, Expands AI Offerings". StorageReview.com. Retrieved 2024-05-08.
  7. ^ Jindal, Siddharth (2024-05-07). "IBM Releases Open-Source Granite Code Models, Outperforms Llama 3". Analytics India Magazine. Retrieved 2024-05-08.
  8. ^ Azhar, Ali (2024-04-08). "IBM Patents a Faster Method to Train LLMs for Enterprises". Datanami. Retrieved 2024-05-08.
  9. ^ Wiggers, Kyle (2023-09-07). "IBM rolls out new generative AI features and models". TechCrunch. Retrieved 2024-05-08.