Sept 21, 2023

LlamaIndex Update — 20/09/2023

By

Ravi Theja

Hello LlamaIndex Enthusiasts!

Welcome to the fifth edition of our LlamaIndex Update series.

Most Important Takeaways:

We’ve open-sourced SECInsights.ai — your gateway to the production RAG framework.
Replit templates — kickstart your projects with zero environment setup hassles.
Build RAG from scratch and get hands-on with our processes.

But wait, there’s more!

So, let’s embark on this journey together. Dive in and explore the offerings of the fifth edition of the LlamaIndex Update series!

Explore our free and paid plans today.

Open-Sourced RAG Platform: LlamaIndex open-sourced http://secinsights.ai, accelerating RAG app development with chat-based Q&A features. Tweet
Linear Adapter Fine-Tuning: LlamaIndex enables efficient fine-tuning of linear adapters on any embedding without re-embedding, enhancing retrieval/RAG across various models. Tweet, Docs, BlogPost
Hierarchical Agents: By structuring LLM agents in a parent-child hierarchy, we enhance complex search and retrieval tasks across diverse data, offering more reliability than a standalone agent. Tweet
SummaryIndex: We’ve renamed ListIndex to SummaryIndex to make it clearer what its main functionality is. Backward compatibility is maintained for existing code using ListIndex. Tweet
Evaluation: LlamaIndex’s new RAG evaluation toolkit offers async capabilities, diverse assessment criteria, and a centralized BaseEvaluator for easier developer integrations. Tweet, Docs.
Hybrid Search for Postgres/pgvector: LlamaIndex introduces a hybrid search for Postgres/pgvector. Tweet, Docs.
Replit Templates: LlamaIndex partners with Replit for easy LLM app templates, including ready-to-use Streamlit apps and full Typescript templates. Tweet, Replit Templates.

Launches with MongoDBReader and type-safe metadata. Tweet.
Launches with chat history, enhanced keyword index, and Notion DB support. Tweet.

OpenAI Fine-Tuning: LlamaIndex unveils a fresh guide on harnessing OpenAI fine-tuning to embed knowledge from any text corpus. In short: generate QA pairs with GPT-4, format them into a training dataset, and proceed to fine-tuning. Tweet, Docs.
Embedding Fine-Tuning: LlamaIndex has a more advanced embedding fine-tuning feature, enabling complex NN query transformations on any embedding, including custom ones, and offering the ability to save intermediate checkpoints for enhanced model control. Tweet, Docs.

Wenqi Glantz tutorial on Fine-Tuning GPT-3.5 RAG Pipeline with GPT-4 Training Data with LlamaIndex fine-tuning abstractions.
Wenqi Glantz tutorial on Fine-Tuning Your Embedding Model to Maximize Relevance Retrieval in RAG Pipeline with LlamaIndex.

Tutorials from the LlamaIndex Team.

Sourabh tutorial on SEC Insights, End-to-End Guide on secinsights.ai
Adam’s tutorial on Custom Tools for Data Agents.
Logan tutorial on retrieval/reranking, covering Node Parsing, AutoMergingRetriever, HierarchicalNodeParser, node post-processors, and the setup of a RouterQueryEngine.

Integration with PortkeyAI: LlamaIndex integrates with PortkeyAI, boosting LLM providers like OpenAI with features like auto fallbacks and load balancing. Tweet, Documentation
Collaboration with Anyscale: LlamaIndex collaborates with anyscalecompute, enabling easy tuning of open-source LLMs using Ray Serve/Train. Tweet, Documentation
Integration with Elastic: LlamaIndex integrates with Elastic, enhancing capabilities such as vector search, text search, hybrid search models, enhanced metadata handling, and es_filters. Tweet, Documentation
Integration with MultiOn: LlamaIndex integrates with MultiOn, enabling data agents to navigate the web and handle tasks via an LLM-designed browser. Tweet, Documentation
Integration with Vectara: LlamaIndex collaborates with Vectara to streamline RAG processes from loaders to databases. Tweet, Blog Post
Integration with LiteLLM: LlamaIndex integrates with LiteLLM, offering access to over 100 LLM APIs and features like chat, streaming, and async operations. Tweet, Documentation
Integration with MonsterAPI: LlamaIndex integrates with MonsterAPI, allowing users to query data using LLMs like Llama 2 and Falcon. Tweet, Blog Post

Jerry Liu spoke on Production Ready LLM Applications at the Arize AI event.
Ravi Theja conducted a workshop at LlamaIndex + Replit Pune Generative AI meetup.
Jerry Liu session on Building a Lending Criteria Chatbot in Production with Stelios from MQube.

Webinar on How to Win an LLM Hackathon by Alex Reibman, Rahul Parundekar, Caroline Frasca, and Yi Ding.
Webinar on LLM Challenges in Production with Mayo Oshin, AI Jason, and Dylan.