Package: cleanNLP 3.1.0
cleanNLP: A Tidy Data Model for Natural Language Processing
Provides a set of fast tools for converting a textual corpus into a set of normalized tables. Users may make use of the 'udpipe' back end with no external dependencies, or a Python back ends with 'spaCy' <https://spacy.io>. Exposed annotation tasks include tokenization, part of speech tagging, named entity recognition, and dependency parsing.
Authors:
cleanNLP_3.1.0.tar.gz
cleanNLP_3.1.0.zip(r-4.5)cleanNLP_3.1.0.zip(r-4.4)cleanNLP_3.1.0.zip(r-4.3)
cleanNLP_3.1.0.tgz(r-4.4-any)cleanNLP_3.1.0.tgz(r-4.3-any)
cleanNLP_3.1.0.tar.gz(r-4.5-noble)cleanNLP_3.1.0.tar.gz(r-4.4-noble)
cleanNLP_3.1.0.tgz(r-4.4-emscripten)cleanNLP_3.1.0.tgz(r-4.3-emscripten)
cleanNLP.pdf |cleanNLP.html✨
cleanNLP/json (API)
NEWS
# Install 'cleanNLP' in R: |
install.packages('cleanNLP', repos = c('https://statsmaths.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/statsmaths/cleannlp/issues
- un - Universal Declaration of Human Rights
- word_frequency - Most frequent English words
corenlpnatural-language-processingspacy
Last updated 6 months agofrom:0e6bf7d8f6. Checks:OK: 7. Indexed: yes.
Target | Result | Date |
---|---|---|
Doc / Vignettes | OK | Nov 16 2024 |
R-4.5-win | OK | Nov 16 2024 |
R-4.5-linux | OK | Nov 16 2024 |
R-4.4-win | OK | Nov 16 2024 |
R-4.4-mac | OK | Nov 16 2024 |
R-4.3-win | OK | Nov 16 2024 |
R-4.3-mac | OK | Nov 16 2024 |
Exports:cnlp_annotatecnlp_download_spacycnlp_init_spacycnlp_init_stringicnlp_init_udpipecnlp_utils_pcacnlp_utils_tfcnlp_utils_tfidf
Dependencies:data.tableherejsonlitelatticeMatrixpngrappdirsRcppRcppTOMLreticulaterlangrprojrootstringiudpipewithr
Creating Text Visualizations with Wikipedia Data
Rendered fromwikipedia.Rmd
usingknitr::rmarkdown
on Nov 16 2024.Last update: 2020-03-07
Started: 2019-10-22
Exploring the State of the Union Addresses: A Case Study with cleanNLP
Rendered fromstate-of-union.Rmd
usingknitr::rmarkdown
on Nov 16 2024.Last update: 2020-03-07
Started: 2019-10-22
Readme and manuals
Help Manual
Help page | Topics |
---|---|
cleanNLP: A Tidy Data Model for Natural Language Processing | cleanNLP-package cleanNLP |
Run the annotation pipeline on a set of documents | cnlp_annotate |
Download model files needed for spacy | cnlp_download_spacy |
Interface for initializing the spacy backend | cnlp_init_spacy |
Interface for initializing the standard R backend | cnlp_init_stringi |
Interface for initializing the udpipe backend | cnlp_init_udpipe |
Compute Principal Components and store as a Data Frame | cnlp_utils_pca |
Construct the TF-IDF Matrix from Annotation or Data Frame | cnlp_utils_tf cnlp_utils_tfidf |
Universal Declaration of Human Rights | un |
Most frequent English words | word_frequency |