Text Mining using 'dplyr', 'ggplot2', and Other Tidy Tools [R package tidytext version 0.4.3]

tidytext: Text Mining using 'dplyr', 'ggplot2', and Other Tidy Tools

Using tidy data principles can make many text mining tasks easier, more effective, and consistent with tools already in wide use. Much of the infrastructure needed for text mining with tidy data frames already exists in packages like 'dplyr', 'broom', 'tidyr', and 'ggplot2'. In this package, we provide functions and supporting data sets to allow conversion of text to and from tidy formats, and to switch seamlessly between tidy tools and existing text mining packages.

Version:	0.4.3
Depends:	R (≥ 2.10)
Imports:	cli, dplyr (≥ 1.1.1), generics, janeaustenr, lifecycle, Matrix, methods, purrr (≥ 0.1.1), rlang (≥ 0.4.10), stringr, tibble, tokenizers, vctrs
Suggests:	broom, covr, data.table, ggplot2, hunspell, knitr, mallet, NLP, quanteda, readr, reshape2, rmarkdown, scales, stm, stopwords, testthat (≥ 2.1.0), textdata, tidyr, tm, topicmodels, vdiffr, wordcloud
Published:	2025-07-25
DOI:	10.32614/CRAN.package.tidytext
Author:	Gabriela De Queiroz [ctb], Colin Fay [ctb], Emil Hvitfeldt [ctb], Os Keyes [ctb], Kanishka Misra [ctb], Tim Mastny [ctb], Jeff Erickson [ctb], David Robinson [aut], Julia Silge [aut, cre]
Maintainer:	Julia Silge <julia.silge at gmail.com>
BugReports:	https://github.com/juliasilge/tidytext/issues
License:	MIT + file LICENSE
URL:	https://juliasilge.github.io/tidytext/, https://github.com/juliasilge/tidytext
NeedsCompilation:	no
Citation:	tidytext citation info
Materials:	README, NEWS
In views:	NaturalLanguageProcessing
CRAN checks:	tidytext results

Documentation:

Reference manual:	tidytext.html , tidytext.pdf
Vignettes:	Tidy Term Frequency and Inverse Document Frequency (tf-idf) (source, R code) Converting to and from Document-Term Matrix and Corpus objects (source, R code) Introduction to tidytext (source, R code)

Downloads:

Package source:	tidytext_0.4.3.tar.gz
Windows binaries:	r-devel: tidytext_0.4.3.zip, r-release: tidytext_0.4.3.zip, r-oldrel: tidytext_0.4.3.zip
macOS binaries:	r-release (arm64): tidytext_0.4.3.tgz, r-oldrel (arm64): tidytext_0.4.3.tgz, r-release (x86_64): tidytext_0.4.3.tgz, r-oldrel (x86_64): tidytext_0.4.3.tgz
Old sources:	tidytext archive

Reverse dependencies:

Reverse depends:	textanalyzer
Reverse imports:	abe, akc, AnimalSequences, available, bibliometrix, CINE, contentanalysis, dail, datamedios, DedooseR, DisasterAlert, DistatisR, DOPE, ggpage, Goodreader, GSEAmining, iheiddown, MadanText, MadanTextNetwork, margaret, miaViz, moodleR, NIMAA, opitools, sherlock, SportMiner, statquotes, sumup, texter, TextForecast, TextMiningGUI, tidylda, tsentiment, ulex, upstartr, vivainsights, WeatherSentiment, weed, widyr, wpa
Reverse suggests:	birddog, bugphyzz, eurlex, funrar, fwtraits, gutenbergr, LexisNexisTools, meetupr, MetMashR, mvrsquared, newsanchor, openintro, polmineR, rfars, schrute, SETA, smartid, spacyr, spRingsteen, textAnnotatoR, textmineR, tidyfst, tidylo, tidypmc
Reverse enhances:	quanteda

Linking:

Please use the canonical form https://CRAN.R-project.org/package=tidytext to link to this page.