HistText Manual
1
Introduction
2
Set Up
2.1
Installation and configuration
2.2
Available Corpora
2.2.1
Statistics
3
Query functions
3.1
Basic search
3.2
Basic concordance
3.3
Full Text Retrieval
3.4
Close reading
3.4.1
ProQuest Documents
3.4.2
Other documents
3.5
Advanced functions
3.5.1
Multifield queries
3.5.2
Date filtering
3.5.3
Concordance
3.5.4
Word embeddings
4
Corpus Statistics
4.1
Word frequencies
4.1.1
Number of occurrences per article
4.1.2
Mean number of occurences over time
4.1.3
Percentage compared to other words over time
4.1.4
Plot or dataframe
4.1.5
Interactivity
4.2
Characters count
4.3
Date-related statistics
4.4
Document Term Matrix (DTM)
4.4.1
Top words
4.4.2
Top words over time
4.4.3
Document Similarity
5
Named Entity Recognition (NER)
5.1
Named Entity Extraction
5.2
Padagraph Visualization
5.3
NER on external documents
6
Question & Answer
6.1
Basic usage
6.2
More complex usage
7
Chinese-specific functions
7.1
Tokenization
7.1.1
Corpus
7.1.2
Data frame
7.2
Conversions
8
Additional features
8.1
Regular Expressions
8.2
Data transformation
9
Appendix
10
Further documentation
References
HistText Manual
10
Further documentation
ENP-China R package: Update 0.2.5
. 08-06-2021, by Jeremy Auguste.
HistText Updates (1.0.0)
. 07-10-2021. 08-06-2021, by Jeremy Auguste.
HistText 1.6.2: Updates & News
. 27-09-2022, by Jeremy Auguste.