Announcing our Document Research Assistant, a collaboration with NVIDIA!
LlamaIndex

LlamaIndex 2024-03-19

LlamaIndex Newsletter 2024-03-19

Greetings, LlamaIndex enthusiasts! 🦙

Welcome to another exciting weekly update from the world of LlamaVerse!

We have an amazing news for you from LlamaIndex. We've officially launched LlamaParse, a GenAI-native document parsing solution. With state-of-the-art table and chart extraction, natural language steerable instructions, and compatibility with over a dozen document types, LlamaParse excels in creating accurate RAG applications from complex documents. After a successful private preview with 2k users and 1M pages parsed, it's now ready to transform your document handling. Check out our launch post for all the details!

🤩 The highlights:

  1. New observability with Instrumentation: Enhanced developer workflow with a new Instrumentation module for improved observability. Docs, Tweet.
  2. LlamaParse accepts natural language parsing instructions: Easily extract math snippets from PDFs into LaTeX with LlamaParse. Blogpost, Tweet.
  3. Financial Data Parsing: Transform PowerPoint parsing, utilizing LlamaParse to extract and interpret complex financial data from .pptx files, enabling detailed and accurate financial analysis. Notebook, Tweet.

✨ Feature Releases and Enhancements:

  • We introduced LlamaIndex v0.10.20, featuring our new Instrumentation module, a leap in observability that simplifies developer workflows by providing a module-level dispatcher, reducing the need for individual callback managers and facilitating comprehensive handler sets across your application. Docs, Tweet.
  • We have launched parsing by prompting feature in LlamaParse to properly extract out any math snippets from PDFs into LaTex which helps you to plug easily into your RAG pipeline. Blogpost, Tweet.
  • We have launched an advanced RAG pipeline for Financial PowerPoints, using LlamaParse to tackle the challenge of parsing .pptx files. Our solution accurately extracts slides, including text, tables, and charts, enabling precise question-answering over complex financial data. Notebook, Tweet.
  • We collaborated with langfuse to launch open-source observability for your RAG pipeline, enhancing your application with integrated tracing, prompt management, and evaluation in just two lines of code. Blogpost, Docs, Tweet.
  • Search-in-the-Chain: a method by Shicheng Xu et al., is now integrated into LlamaIndex, enhancing question-answering with an advanced system that interleaves retrieval and planning. This approach verifies each reasoning step in a chain, allowing for dynamic replanning and application in various agent reasoning contexts. LlamaPack, Tweet

🎥 Demos:

  • Home AI, a tool created with create-llama, to help home searches by using LLMs to automate the parsing of complex property disclosures, enabling users to filter searches with unprecedented detail and efficiency. Blogpost, Code, Tweet.

🗺️ Guides:

  • Guide to using LlamaIndex and Mathpix to parse, index, and query complex mathematics within scientific papers, detailing steps from parsing tables and extracting images to indexing in a RAG app and answering questions with precise LaTeX outputs, to showcase hierarchical retrieval technique.

✍️ Tutorials:

🎥 Webinars:

  • Webinar with Charles Packer, lead author of MemGPT on Long-Term, Self-Editing Memory with MemGPT

📅 Events:

  • We are hosting a RAG meetup in Paris on March 27th featuring talks on advanced RAG strategies, building a RAG CLI, and the significance of open-source RAG in business.