LlamaIndex

LlamaIndex Feb 27, 2024

LlamaIndex Newsletter 2024–02–27

Yo, LlamaIndex Fans 🦙,

Dive into a week brimming with thrilling developments at LlamaIndex! The dynamic input from our community and our rich selection of learning materials are all set to enhance your journey with LlamaIndex.

Last week, the LlamaIndex ecosystem took a significant leap forward with the launch of LlamaCloud, a suite of advanced services designed for production-level context enhancement in LLM and RAG applications:

  • LlamaParse: Offers sophisticated parsing for complex documents, making it possible to answer detailed queries.
  • Managed Ingestion and Retrieval API: Facilitates easier data management, connecting with over 150 sources and 40+ storage solutions.

LlamaParse is now available for a public preview, primarily focusing on PDFs with a user cap, while the API is in a private preview for select enterprise partners. If you haven’t explored these new features yet, we invite you to check them out for more details or to discuss commercial terms.

Your innovation inspires us! We’re eager to see the projects, articles, or videos that inspire you. Share your remarkable works with us at news@llamaindex.ai. And if you haven’t already, subscribe to our newsletter on our website to receive the latest LlamaIndex updates straight to your inbox.

🤩 The highlights:

  1. Enhanced RAG Retrieval with Sub-Document Summaries: Introducing a novel chunking method that improves RAG performance by incorporating hierarchical metadata into chunks, ensuring precise and context-aware information retrieval. Notebook, Tweet.
  2. MistralAI Cookbook: A comprehensive guide to leveraging the Mistral-Large model from MistralAI, featuring near-GPT-4 reasoning, function calling, and JSON output for cutting-edge applications. Docs, Tweet.
  3. Gemma Cookbook: A comprehensive guide to using Gemma, GoogleDeepMind’s latest LLM offering, with options for 2B and 7B parameters, facilitating the development of local RAG systems on your laptop. Notebook, Tweet.
  4. ColBERT Integration: Document reranking with ColBERT via LlamaIndex, delivering a solution that is about 100x faster than BERT-based models for more efficient data processing. Notebook, Tweet.
  5. Counselor Copilot — Social Impact Through RAG: Spotlight on Counselor Copilot, an innovative RAG project supporting the Trevor Project’s crisis counselors, providing real-time assistance with context, suggestions, and actions to aid LGBTQ+ youth effectively. BlogPost, Tweet.

✨ Feature Releases and Enhancements:

  • We have launched a new chunking strategy to enhance RAG retrieval: Sub-Document Summaries. This approach overcomes the limitations of naive chunking by injecting hierarchical metadata, offering a nuanced balance of global context awareness and precision through subdocument summaries for improved performance. Notebook, Tweet.
  • We have launched a cookbook for the latest mistral-large model from MistralAI offering advanced features like near GPT-4 level reasoning, Function calling, JSON Output, and more. Docs, Tweet.
  • We have launched a cookbook on Gemma, a new family of state-of-the-art LLMs by GoogleDeepMind, with 2B and 7B parameter options using Ollama to build local RAG on your laptop. Notebook, Tweet.
  • We have introduced ColBERT through LlamaIndex, offering a one-line integration for a reranking model that’s ~100x faster than traditional BERT-based models, ensuring efficient document handling with superior performance. Notebook, Tweet.
  • We have introduced a way to integrate advanced RAG into full-stack web apps with create-llama, using LlamaPacks, in just two lines of code. create-llama, Tweet.

🎥 Demos:

Counselor Copilot: An interesting RAG project by Riya Jagetia and team, designed to assist crisis counselors at the Trevor Project in supporting LGBTQ+ youth. This tool acts as a real-time copilot, offering context, suggested replies, and various actions to enhance counselor effectiveness, showcasing a unique and socially impactful application of advanced RAG techniques. BlogPost, Tweet.

🗺️ Guides:

  • Guide to simplifying advanced RAG development: Our latest insights pinpoint solutions for key challenges, including our innovative LlamaParse for complex PDF QA, shared in our AI in Production presentation.

✍️ Tutorials:

🎥 Webinar:

  • Webinar with Sisil from JasperAI on Practical Tips and Tricks for Productionizing RAG.