corpustools: Managing, Querying and Analyzing Tokenized Text12 months ago
Introduction | Creating a tcorpus | creating a tcorpus from full-text | Additional options | Importing a tokenlist | Managing a tCorpus | Adding, removing and mutating columns | Subsetting a tCorpus | Deduplication | Preprocessing | Basic preprocessing | Advanced preprocessing with UDPipe | Create_tcorpus keeps a persistent cache | Using multiple cores | Filtering tokens | Creating a DTM or DFM | Why keep the full corpus intact? | Querying the tcorpus | search_features() | Counting hits and plotting | Associations | Inspect results in full text | Adding query hits as token features | search_contexts() | Subset by search_contexts() | search_dictionary | Text analysis techniques | Semantic networks based on co-occurence | Corpus comparisons | Feature associations | Using the tcorpus R6 methods | Being carefull with shallow copies. | Copying a tCorpus
