Title: Chinese Character Frequency in Real World
Version: 0.1
Description: It contains Chinese character frequency data based on news data from 2017 to 2019. Source of these news include Sina, China daily and Tencent.
Depends: R (≥ 3.3.2)
License: CC0
Encoding: UTF-8
LazyData: true
RoxygenNote: 7.0.1
NeedsCompilation: no
Packaged: 2019-12-13 01:35:34 UTC; claret
Author: Yitao Ma [aut, cre], Xinjin Zhang [aut], Bin Cui [aut]
Maintainer: Yitao Ma <claret0127@gmail.com>
Repository: CRAN
Date/Publication: 2019-12-18 12:20:02 UTC

Chinese Word Frequency with five Characters

Description

This dataframe inclues the usage frequecy and occurence of Chinese word

Usage

fiveChar

Format

A data frame with 3 variables:

character

target character

freq

occurence of target character

pct

frequency of target character

Examples

 fiveChar

Chinese Word Frequency with four Characters

Description

This dataframe inclues the usage frequecy and occurence of Chinese word

Usage

fourChar

Format

A data frame with 3 variables:

character

target character

freq

occurence of target character

pct

frequency of target character

Examples

 fourChar

Chinese Word Frequency with one Character

Description

This dataframe inclues the usage frequecy and occurence of Chinese word

Usage

oneChar

Format

A data frame with 3 variables:

character

target character

freq

occurence of target character

pct

frequency of target character

Examples

 oneChar

Chinese Word Frequency with three Characters

Description

This dataframe inclues the usage frequecy and occurence of Chinese word

Usage

threeChar

Format

A data frame with 3 variables:

character

target character

freq

occurence of target character

pct

frequency of target character

Examples

 threeChar

Chinese Word Frequency with two Characters

Description

This dataframe inclues the usage frequecy and occurence of Chinese word

Usage

twoChar

Format

A data frame with 3 variables:

character

target character

freq

occurence of target character

pct

frequency of target character

Examples

 twoChar

Occurrence and frequency of the next word

Description

This is a function that lists the next word appearing frequency by a given chinese character based on real world chinese character frequency Statistics table.

Usage

wordMiner.next(candidateWord, topN = 10)

Arguments

candidateWord

A single Chinese character

topN

The number of raws of output dataframe

Value

A dataframe containing the next word and its occurrence and frequency

Examples

## Not run: 
wordMiner.next(strsplit(levels(twoChar$character)[988], "")[[1]][1])
wordMiner.next(strsplit(levels(twoChar$character)[988], "")[[1]][2], 20)

## End(Not run)

Occurrence and frequency of the previous word

Description

This is a function that lists the previous word appearing frequency by a given chinese character based on real world chinese character frequency Statistics table.

Usage

wordMiner.previous(candidateWord, topN = 10)

Arguments

candidateWord

A single Chinese character

topN

The number of raws of output dataframe

Value

A dataframe containing the previous word and its occurrence and frequency

Examples

## Not run: 
wordMiner.previous(strsplit(levels(twoChar$character)[988], "")[[1]][1])
wordMiner.previous(strsplit(levels(twoChar$character)[988], "")[[1]][2], 20)

## End(Not run)