Description
- Using Text String Within Formula In Excel
- Using Textstat Keyboard
- Using Textstat To Find
- Using Textstat To Use
- Using Textstat To Draw
- Using Textstat To Make
Apr 25, 2016 TextSTAT 2.9c for Windows (ZIP file, approx. 8 MB, Feb 20, 2014) This version includes everything you need to use TextSTAT with Windows. It comes as a single installation file. To install the programme, just unpack the file to a directory of your choice. To run TextSTAT, change to that directory and doubleclick on 'TextSTAT.exe'. Nov 22, 2020 Textstat. Textstat is an easy to use library to calculate statistics from text. It helps determine readability, complexity, and grade level. Photo by Patrick Tomasso on Unsplash. If you wanted to compare the frequency of a single term across different texts, you can also use textstatfrequency, group the frequency by speech and extract the term. TextSTAT 2.9c for Windows (ZIP file, approx. 8 MB, Feb 20, 2014) This version includes everything you need to use TextSTAT with Windows. It comes as a single installation file. To install the programme, just unpack the file to a directory of your choice. To run TextSTAT, change to that directory and doubleclick on 'TextSTAT.exe'.
Produces counts and document frequencies summaries of the features in adfm, optionally grouped by a docvars variable or other suppliedgrouping variable.
Usage
Using Text String Within Formula In Excel
This is the default setting for TextSTAT. A different encoding can also be used. This, however, has to be set up prior to the file being read in (via the menu entry: 'Options File encoding'). TextSTAT processes the texts internally in Unicode format. HTML files can be read in directly from the Internet or from your own hard disk.
Arguments
a dfm object
(optional) integer specifying the top n
features to be returned,within group if groups
is specified
either: a character vector containing the names of documentvariables to be used for grouping; or a factor or object that can becoerced into a factor equal in length or rows to the number of documents.NA
values of the grouping value are dropped.See groups for details.
character string specifying how ties are treated. Seedata.table::frank()
for details. Unlike that function,however, the default is 'min'
, so that frequencies of 10, 10, 11would be ranked 1, 1, 3.
additional arguments passed to dfm_group()
. This canbe useful in passing force = TRUE
, for instance, if you are grouping adfm that has been weighted.
Value
a data.frame containing the following variables:
feature
(character) the feature
Using Textstat Keyboard
frequency
count of the feature
Using Textstat To Find
rank
rank of the feature, where 1 indicates the greatestfrequency
docfreq
document frequency of the feature, as a count (thenumber of documents in which this feature occurred at least once)
Using Textstat To Use
docfreq
document frequency of the feature, as a count
Using Textstat To Draw
group
(only if groups
is specified) the label of the group.If the features have been grouped, then all counts, ranks, and documentfrequencies are within group. If groups is not specified, the group
column is omitted from the returned data.frame.
textstat_frequency
returns a data.frame of features andtheir term and document frequencies within groups.