
Welcome to ContextGem Documentation!#
ContextGem is a free, open-source LLM framework that makes it radically easier to extract structured data and insights from documents — with minimal code.
Learn about the motivation, comparisons with other frameworks, and how ContextGem works.
Instructions to install ContextGem and quickly start using it.
Learn how to identify and extract specific document sections like clauses, chapters, or terms using ContextGem’s Aspects API.
Learn how to extract and infer structured data like JSON objects, strings, numbers, dates, booleans, ratings, and labels from documents using ContextGem’s Concepts API.
Learn about supported cloud LLM providers and local models, and how to configure and use them for extraction.
Learn how to use ContextGem’s built-in document converters for files such as DOCX.
Explore advanced features and techniques for extracting data from documents.
Learn how to optimize your extraction pipeline for accuracy, cost, and performance.
Learn how to serialize and deserialize ContextGem objects for storage and transfer.
Complete API documentation for all ContextGem modules and classes.