Using tidy data principles can make many text mining tasks
easier, more effective, and consistent with tools already in wide use.
Much of the infrastructure needed for text mining with tidy data
frames already exists in packages like 'dplyr', 'broom', 'tidyr', and
'ggplot2'. In this package, we provide functions and supporting data
sets to allow conversion of text to and from tidy formats, and to
switch seamlessly between tidy tools and existing text mining
packages.
Version: |
0.4.2 |
Depends: |
R (≥ 2.10) |
Imports: |
cli, dplyr (≥ 1.1.1), generics, janeaustenr, lifecycle, Matrix, methods, purrr (≥ 0.1.1), rlang (≥ 0.4.10), stringr, tibble, tokenizers, vctrs |
Suggests: |
broom, covr, data.table, ggplot2, hunspell, knitr, mallet, NLP, quanteda, readr, reshape2, rmarkdown, scales, stm, stopwords, testthat (≥ 2.1.0), textdata, tidyr, tm, topicmodels, vdiffr, wordcloud |
Published: |
2024-04-10 |
DOI: |
10.32614/CRAN.package.tidytext |
Author: |
Gabriela De Queiroz [ctb],
Colin Fay [ctb],
Emil Hvitfeldt [ctb],
Os Keyes [ctb],
Kanishka Misra [ctb],
Tim Mastny [ctb],
Jeff Erickson [ctb],
David Robinson [aut],
Julia Silge [aut,
cre] |
Maintainer: |
Julia Silge <julia.silge at gmail.com> |
BugReports: |
https://github.com/juliasilge/tidytext/issues |
License: |
MIT + file LICENSE |
URL: |
https://juliasilge.github.io/tidytext/,
https://github.com/juliasilge/tidytext |
NeedsCompilation: |
no |
Citation: |
tidytext citation info |
Materials: |
README, NEWS |
In views: |
NaturalLanguageProcessing |
CRAN checks: |
tidytext results [issues need fixing before 2025-09-03] |