pacman - Package Management Tool
Tools to more conveniently perform tasks associated with add-on packages. pacman conveniently wraps library and package related functions and names them in an intuitive and consistent fashion. It seeks to combine functionality from lower level functions which can speed up workflow.
Last updated
githubpackage-managementpackages
13.57 score 325 stars 8 dependents 16k scripts 64k downloadstextclean - Text Cleaning Tools
Tools to clean and process text. Tools are geared at checking for substrings that are not optimal for analysis and replacing or removing them (normalizing) with more analysis friendly substrings (see Sproat, Black, Chen, Kumar, Ostendorf, & Richards (2001) <doi:10.1006/csla.2001.0169>) or extracting them into new variables. For example, emoticons are often used in text but not always easily handled by analysis algorithms. The replace_emoticon() function replaces emoticons with word equivalents.
Last updated
data-mungingemoticonsregextext-analysistext-cleaning
10.48 score 258 stars 28 dependents 848 scripts 6.5k downloadsqdap - Bridging the Gap Between Qualitative Data and Quantitative Analysis
Automates many of the tasks associated with quantitative discourse analysis of transcripts containing discourse including frequency counts of sentence types, words, sentences, turns of talk, syllables and other assorted analysis tasks. The package provides parsing tools for preparing transcript data. Many functions enable the user to aggregate data by any number of grouping variables, providing analysis and seamless integration with other R packages that undertake higher level analysis and visualization of text. This affords the user a more efficient and targeted analysis. 'qdap' is designed for transcript analysis, however, many functions are applicable to other areas of Text Mining/ Natural Language Processing.
Last updated
qdapquantitative-discourse-analysistext-analysistext-miningtext-plottingopenjdk
9.57 score 187 stars 5 dependents 1.5k scripts 1.4k downloadssentimentr - Calculate Text Polarity Sentiment
Calculate text polarity sentiment at the sentence level and optionally aggregate by rows or grouping variable(s).
Last updated
amplifierpolaritysentimentsentiment-analysisvalence-shifter
9.53 score 438 stars 4 dependents 772 scripts 1.9k downloadsqdapRegex - Regular Expression Removal, Extraction, and Replacement Tools
A collection of regular expression tools associated with the 'qdap' package that may be useful outside of the context of discourse analysis. Tools include removal/extraction/replacement of abbreviations, dates, dollar amounts, email addresses, hash tags, numbers, percentages, citations, person tags, phone numbers, times, and zip codes.
Last updated
qdapregexregular-expression
9.26 score 50 stars 41 dependents 532 scripts 11k downloadstextstem - Tools for Stemming and Lemmatizing Text
Tools that stem and lemmatize text. Stemming is a process that removes endings such as affixes. Lemmatization is the process of grouping inflected forms together as a single base form.
Last updated
lemmatizationstemmingtext-mining
9.12 score 46 stars 15 dependents 1.6k scripts 4.0k downloadstextshape - Tools for Reshaping Text
Tools that can be used to reshape and restructure text data.
Last updated
data-reshapingmanipulationsentence-boundary-detectiontext-datatext-formatingtidy
9.07 score 53 stars 42 dependents 400 scripts 8.9k downloadslexicon - Lexicons for Text Analysis
A collection of lexical hash tables, dictionaries, and word lists.
Last updated
hashlexiconlookupnames-frequentstopwordstext-dictionariestext-mining
8.98 score 113 stars 31 dependents 278 scripts 6.6k downloadswakefield - Generate Random Data Sets
Generates random data sets including: data.frames, lists, and vectors.
Last updated
data-generationwakefield
7.40 score 256 stars 189 scripts 1.7k downloadsqdapTools - Tools for the 'qdap' Package
A collection of tools associated with the 'qdap' package that may be useful outside of the context of text analysis.
Last updated
7.15 score 15 stars 7 dependents 390 scripts 1.2k downloadsnumform - Tools to Format Numbers for Publication
Format numbers and plots for publication; includes the removal of leading zeros, standardization of number of digits, addition of affixes, and a p-value formatter. These tools combine the functionality of several 'base' functions such as 'paste()', 'format()', and 'sprintf()' into specific use case functions that are named in a way that is consistent with usage, making their names easy to remember and easy to deploy.
Last updated
number-formating
6.14 score 52 stars 1 dependents 178 scripts 502 downloadsqdapDictionaries - Dictionaries and Word Lists for the 'qdap' Package
A collection of text analysis dictionaries and word lists for use with the 'qdap' package.
Last updated
6.08 score 4 stars 8 dependents 159 scripts 1.6k downloads