
Welcome to ContextGem Documentation!#
ContextGem is a free, open-source LLM framework that makes it radically easier to extract structured data and insights from documents — with minimal code.
Learn about the motivation, comparisons with other frameworks, and how ContextGem works.
Instructions to install ContextGem and quickly start using it.
Learn how to create and configure documents to extract information (aspects and concepts) from.
Learn how to use ContextGem’s built-in document converters for files such as DOCX.
Learn how to identify and extract specific document sections like clauses, chapters, or terms using ContextGem’s Aspects API.
Learn how to extract and infer structured data like JSON objects, strings, numbers, dates, booleans, ratings, and labels from documents using ContextGem’s Concepts API.
Learn how to create reusable extraction pipelines that combine aspects and concepts for consistent document analysis across multiple files.
Learn about supported cloud LLM providers and local models, and how to configure and use them for extraction.
Explore advanced features and techniques for extracting data from documents.
Learn how to optimize your extraction pipeline for accuracy, cost, and performance.
Learn how to serialize and deserialize ContextGem objects for storage and transfer.
Complete API documentation for all ContextGem modules and classes.