A framework for statistical analysis in content analysis. In addition to a pipeline for preprocessing text corpora and linking to the latent Dirichlet allocation from the 'lda' package, plots are offered for the descriptive analysis of text corpora and topic models. In addition, an implementation of Chang's intruder words and intruder topics is provided. Sample data for the vignette is included in the toscaData package, which is available on gitHub: <https://github.com/Docma-TU/toscaData>.
Version: | 0.3-2 |
Depends: | R (≥ 3.5.0) |
Imports: | tm (≥ 0.7-5), lda (≥ 1.4.2), quanteda (≥ 1.4.0), lubridate (≥ 1.7.3), htmltools (≥ 0.3.6), RColorBrewer (≥ 1.1-2), stringr (≥ 1.3.1), WikipediR (≥ 1.5.0), data.table (≥ 1.11.4) |
Suggests: | toscaData, testthat (≥ 2.0.0), knitr (≥ 1.20), devtools (≥ 1.13), rmarkdown (≥ 1.9) |
Published: | 2021-10-28 |
DOI: | 10.32614/CRAN.package.tosca |
Author: | Lars Koppers |
Maintainer: | Lars Koppers <koppers at statistik.tu-dortmund.de> |
License: | GPL-2 | GPL-3 [expanded from: GPL (≥ 2)] |
URL: | https://github.com/Docma-TU/tosca, https://doi.org/10.5281/zenodo.3591068 |
NeedsCompilation: | no |
Citation: | tosca citation info |
CRAN checks: | tosca results [issues need fixing before 2025-04-22] |
Reference manual: | tosca.pdf |
Vignettes: |
Vignette tosca |
Package source: | tosca_0.3-2.tar.gz |
Windows binaries: | r-devel: tosca_0.3-2.zip, r-release: tosca_0.3-2.zip, r-oldrel: tosca_0.3-2.zip |
macOS binaries: | r-devel (arm64): tosca_0.3-2.tgz, r-release (arm64): tosca_0.3-2.tgz, r-oldrel (arm64): tosca_0.3-2.tgz, r-devel (x86_64): tosca_0.3-2.tgz, r-release (x86_64): tosca_0.3-2.tgz, r-oldrel (x86_64): tosca_0.3-2.tgz |
Old sources: | tosca archive |
Reverse imports: | rollinglda |
Reverse suggests: | ldaPrototype |
Please use the canonical form https://CRAN.R-project.org/package=tosca to link to this page.