Converting text to numerical features requires specifically created procedures, which are implemented as steps according to the 'recipes' package. These steps allows for tokenization, filtering, counting (tf and tfidf) and feature hashing.
Version: | 1.0.6 |
Depends: | R (≥ 3.6), recipes (≥ 1.0.7) |
Imports: | lifecycle, dplyr, generics (≥ 0.1.0), magrittr, Matrix, purrr, rlang, SnowballC, tibble, tokenizers, vctrs, glue |
LinkingTo: | cpp11 |
Suggests: | covr, data.table, dials (≥ 1.2.0), hardhat, janitor, knitr, modeldata, rmarkdown, sentencepiece, spacyr, stopwords, stringi, testthat (≥ 3.0.0), text2vec, tokenizers.bpe, udpipe, wordpiece |
Published: | 2023-11-15 |
DOI: | 10.32614/CRAN.package.textrecipes |
Author: | Emil Hvitfeldt [aut, cre], Michael W. Kearney [cph] (author of count_functions), Posit Software, PBC [cph, fnd] |
Maintainer: | Emil Hvitfeldt <emil.hvitfeldt at posit.co> |
BugReports: | https://github.com/tidymodels/textrecipes/issues |
License: | MIT + file LICENSE |
URL: | https://github.com/tidymodels/textrecipes, https://textrecipes.tidymodels.org/ |
NeedsCompilation: | yes |
SystemRequirements: | "GNU make" |
Materials: | README NEWS |
CRAN checks: | textrecipes results |
Reference manual: | textrecipes.pdf |
Vignettes: |
Working with n-grams Cookbook - Using more complex recipes involving text Under the hood - tokenlist |
Package source: | textrecipes_1.0.6.tar.gz |
Windows binaries: | r-devel: textrecipes_1.0.6.zip, r-release: textrecipes_1.0.6.zip, r-oldrel: textrecipes_1.0.6.zip |
macOS binaries: | r-release (arm64): textrecipes_1.0.6.tgz, r-oldrel (arm64): textrecipes_1.0.6.tgz, r-release (x86_64): textrecipes_1.0.6.tgz, r-oldrel (x86_64): textrecipes_1.0.6.tgz |
Old sources: | textrecipes archive |
Please use the canonical form https://CRAN.R-project.org/package=textrecipes to link to this page.