Text Analysis

This interactive text analysis dashboard explores the language used in newspaper coverage of the Trail of Broken Treaties and BIA occupation. The analysis includes 93 documents containing 44,588 total words and 7,468 unique terms (after removing common stopwords).

Use the visualizations below to explore word frequencies, temporal patterns, and keywords in context across the archival sources. For a full description of the corpus, data pipeline, and methods, see the Methods and Data Pipeline page.

Word Frequency

The 50 most frequently occurring words in the newspaper coverage (stopwords removed). Words like "indian," "government," "protesters," and "treaty" dominate the discourse.

Bigram Frequency

The 30 most frequent two-word phrases (stopwords removed). Bigrams reveal recurring pairings like "bureau indian," "trail broken," and "white house" that shaped how events were framed.

Trigram Frequency

The 25 most frequent three-word phrases. Trigrams surface specific named entities and recurring formulations — "bureau indian affairs," "trail broken treaties," and "american indian movement" — that anchored media coverage.

Document Statistics

Distribution of documents by publication source.

Word Cloud

Visual representation of the 50 most common words. Click any word to see it in context below.

Keywords in Context (KWIC)

Search for any term and see where it appears in the documents with surrounding context.