textclean: Text Cleaning Tools
Tools to clean and process text. Tools are geared at checking for substrings that
are not optimal for analysis and replacing or removing them (normalizing) with more
analysis friendly substrings (see Sproat, Black, Chen, Kumar, Ostendorf, & Richards
(2001) <doi:10.1006/csla.2001.0169>) or extracting them into new variables. For
example, emoticons are often used in text but not always easily handled by analysis
algorithms. The replace_emoticon() function replaces emoticons with word
equivalents.
Version: |
0.9.3 |
Depends: |
R (≥ 3.4.0) |
Imports: |
data.table, english (≥ 1.0-2), glue (≥ 1.3.0), lexicon (≥
1.0.0), mgsub (≥ 1.5.0), qdapRegex, stringi, textshape (≥
1.0.1), utils |
Suggests: |
testthat |
Published: |
2018-07-23 |
DOI: |
10.32614/CRAN.package.textclean |
Author: |
Tyler Rinker [aut, cre],
ctwheels StackOverflow [ctb] |
Maintainer: |
Tyler Rinker <tyler.rinker at gmail.com> |
BugReports: |
https://github.com/trinker/textclean/issues |
License: |
GPL-2 |
URL: |
https://github.com/trinker/textclean |
NeedsCompilation: |
no |
Citation: |
textclean citation info |
Materials: |
README NEWS |
CRAN checks: |
textclean results |
Documentation:
Downloads:
Reverse dependencies:
Linking:
Please use the canonical form
https://CRAN.R-project.org/package=textclean
to link to this page.