Welcome to ContextGem Documentation!

ContextGem logo

Welcome to ContextGem Documentation!#

ContextGem is a free, open-source LLM framework that makes it radically easier to extract structured data and insights from documents — with minimal code.


📚 Project Description

Learn about the motivation, comparisons with other frameworks, and how ContextGem works.

Why ContextGem?
🚀 Getting Started

Instructions to install ContextGem and quickly start using it.

Installation
📋 Extracting Aspects

Learn how to identify and extract specific document sections like clauses, chapters, or terms using ContextGem’s Aspects API.

Aspect Extraction
🧩 Extracting Concepts

Learn how to extract and infer structured data like JSON objects, strings, numbers, dates, booleans, ratings, and labels from documents using ContextGem’s Concepts API.

Supported Concepts
🤖 Large Language Models

Learn about supported cloud LLM providers and local models, and how to configure and use them for extraction.

Supported LLMs
🔄 Document Converters

Learn how to use ContextGem’s built-in document converters for files such as DOCX.

DOCX Converter
🔍 Advanced Usage

Explore advanced features and techniques for extracting data from documents.

Advanced usage examples
⚙️ Optimization Guide

Learn how to optimize your extraction pipeline for accuracy, cost, and performance.

Choosing the Right LLM(s)
💾 Serialization

Learn how to serialize and deserialize ContextGem objects for storage and transfer.

Serializing objects and results
📖 API Reference

Complete API documentation for all ContextGem modules and classes.

Documents

Indices and tables#