LlamaIndex

LlamaIndex Nov 13, 2023

LlamaIndex turns 1!

It’s our birthday! One year ago, Jerry pushed his first commit to GPT Index, the project that would become LlamaIndex. It worked with GPT-3, the state of the art model available at the time. That initial version was very simple, but the problem statement — and the solution — remain the same:

one fundamental limitation of GPT-3 is the context size […] the ability to feed “knowledge” to GPT-3 is mostly limited to this limited prompt size […] But what if GPT-3 can have access to potentially a much larger database of knowledge[…]?

Twelve months have passed and there’s been a tsunami of new developments in the world of generative AI and LLMs, but the reason LlamaIndex was invented remains: even the most sophisticated model isn’t trained on your data, which can be locked behind an API or in a SQL database, and even the latest GPT-4-Turbo context size of 128,000 tokens isn’t enough to hold even a relatively modest dataset. Retrieval-Augmented Generation (RAG) is here to stay.

Big numbers

At just 1 year old, LlamaIndex has gotten very big. How big? Here’s some numbers:

Big thanks

But big numbers aside, the thing we’re proudest of is our community: we have users in (nearly) every country in the world, from single hobby developers to Fortune 500 companies and everyone in between. LlamaIndex’s founder, Jerry Liu, says:

Our community is everything at LlamaIndex. We love seeing the amazing things people are building every day! It’s what gets us up in the morning and keeps us motivated to keep pushing the boundaries of what developers can do with GenAI. And we’re especially grateful to the developers who give back by pushing PRs, issues and bug reports. They’re what makes the open source world go round.

Big milestones

What’s happened in a year? Well, everything! But here’s some highlights:

  • November 2022: Launched GPT Tree Index, a way of organizing information into a tree. Based on the initial interest/traction, we expanded this into a List Index and Keyword Index. Then ChatGPT launched in November
  • December 2022: Some big feature releases: support for indexing embeddings + vector stores, and initial data loaders for Notion, Slack, and Google Drive
  • January 2023: LlamaIndex hits Github trending for the first time!
  • February 2023: We launched LlamaHub with Jesse Zhang, containing an initial repository of data loaders for users to access. We ran a sweepstakes with OctoML and got 50+ data loader submissions!
  • March 2023: ChatGPT API launched and then Plugins. We scrambled to support the new API + Plugin integrations.
  • April 2023: We incorporated!
  • May 2023: At the end of April, we launched 0.6.0, where we completely rewrote the entire framework from the ground-up for greater modularity and composability for different levels of abstraction.
  • June 2023: We announced that we raised $8.5M in funding!
  • July 2023: We launched Data Agents + Agent Tools on LlamaHub. We also launched a Typescript package
  • August 2023: We integrated with OpenAI fine-tuning and launched a variety of LLM and embedding fine-tuning abstractions.
  • September 2023: We launched secinsights.ai — a production-ready full-stack application
  • October 2023: We launched LlamaIndex Chat — a full-stack Typescript template.
  • November 2023: Went fully multi-modal with the release of GPT-4-vision!

Big plans

With all that growth and all those features, what’s next for us? Stay tuned!