pacman - Package Management Tool
Tools to more conveniently perform tasks associated with add-on packages. pacman conveniently wraps library and package related functions and names them in an intuitive and consistent fashion. It seeks to combine functionality from lower level functions which can speed up workflow.
Last updated 5 years ago
githubpackage-managementpackages
14.24 score 315 stars 10 dependents 14k scripts 285k downloadstextclean - Text Cleaning Tools
Tools to clean and process text. Tools are geared at checking for substrings that are not optimal for analysis and replacing or removing them (normalizing) with more analysis friendly substrings (see Sproat, Black, Chen, Kumar, Ostendorf, & Richards (2001) <doi:10.1006/csla.2001.0169>) or extracting them into new variables. For example, emoticons are often used in text but not always easily handled by analysis algorithms. The replace_emoticon() function replaces emoticons with word equivalents.
Last updated 3 years ago
data-mungingemoticonsregextext-analysistext-cleaning
10.08 score 248 stars 22 dependents 760 scripts 6.5k downloadsqdap - Bridging the Gap Between Qualitative Data and Quantitative Analysis
Automates many of the tasks associated with quantitative discourse analysis of transcripts containing discourse including frequency counts of sentence types, words, sentences, turns of talk, syllables and other assorted analysis tasks. The package provides parsing tools for preparing transcript data. Many functions enable the user to aggregate data by any number of grouping variables, providing analysis and seamless integration with other R packages that undertake higher level analysis and visualization of text. This affords the user a more efficient and targeted analysis. 'qdap' is designed for transcript analysis, however, many functions are applicable to other areas of Text Mining/ Natural Language Processing.
Last updated 4 years ago
qdapquantitative-discourse-analysistext-analysistext-miningtext-plottingopenjdk
9.61 score 176 stars 3 dependents 1.3k scripts 3.2k downloadsqdapRegex - Regular Expression Removal, Extraction, and Replacement Tools
A collection of regular expression tools associated with the 'qdap' package that may be useful outside of the context of discourse analysis. Tools include removal/extraction/replacement of abbreviations, dates, dollar amounts, email addresses, hash tags, numbers, percentages, citations, person tags, phone numbers, times, and zip codes.
Last updated 1 years ago
qdapregexregular-expression
9.48 score 50 stars 41 dependents 502 scripts 9.8k downloadssentimentr - Calculate Text Polarity Sentiment
Calculate text polarity sentiment at the sentence level and optionally aggregate by rows or grouping variable(s).
Last updated 3 years ago
amplifierpolaritysentimentsentiment-analysisvalence-shifter
9.43 score 432 stars 2 dependents 680 scripts 3.4k downloadstextshape - Tools for Reshaping Text
Tools that can be used to reshape and restructure text data.
Last updated 12 months ago
data-reshapingmanipulationsentence-boundary-detectiontext-datatext-formatingtidy
9.18 score 50 stars 34 dependents 266 scripts 11k downloadslexicon - Lexicons for Text Analysis
A collection of lexical hash tables, dictionaries, and word lists.
Last updated 3 years ago
hashlexiconlookupnames-frequentstopwordstext-dictionariestext-mining
8.80 score 111 stars 25 dependents 224 scripts 6.8k downloadstextstem - Tools for Stemming and Lemmatizing Text
Tools that stem and lemmatize text. Stemming is a process that removes endings such as affixes. Lemmatization is the process of grouping inflected forms together as a single base form.
Last updated 7 years ago
lemmatizationstemmingtext-mining
8.71 score 45 stars 11 dependents 888 scripts 3.9k downloadswakefield - Generate Random Data Sets
Generates random data sets including: data.frames, lists, and vectors.
Last updated 4 years ago
data-generationwakefield
7.13 score 256 stars 209 scripts 739 downloadsqdapTools - Tools for the 'qdap' Package
A collection of tools associated with the 'qdap' package that may be useful outside of the context of text analysis.
Last updated 2 years ago
7.04 score 16 stars 5 dependents 408 scripts 2.2k downloadsnumform - Tools to Format Numbers for Publication
Format numbers and plots for publication; includes the removal of leading zeros, standardization of number of digits, addition of affixes, and a p-value formatter. These tools combine the functionality of several 'base' functions such as 'paste()', 'format()', and 'sprintf()' into specific use case functions that are named in a way that is consistent with usage, making their names easy to remember and easy to deploy.
Last updated 3 years ago
number-formating
6.06 score 51 stars 1 dependents 151 scripts 800 downloadsqdapDictionaries - Dictionaries and Word Lists for the 'qdap' Package
A collection of text analysis dictionaries and word lists for use with the 'qdap' package.
Last updated 7 years ago
5.99 score 4 stars 6 dependents 113 scripts 2.4k downloads