Introduction

This page describes the Python API for the Text Extensions for Pandas library. Text Extensions for Pandas adds NLP-specific data types, operations, and library integrations to Pandas. These extensions make it easier to manipulate and analyze NLP-related data with Pandas.

This documentation is organized into the following sections:

Pandas Extension Types

Pandas extension types for representing spans and tensors inside Pandas DataFrames.

Input and Output

Integrations with various NLP libraries and NLP-specific file formats for converting various types of NLP data into Pandas DataFrames.

Spanner Algebra

Span-specific operations for Pandas DataFrames, based on the Document Spanners formalism, also known as spanner algebra.

Support for Jupyter Notebooks

Support for displaying and editing data in

Indices and tables