官术网_书友最值得收藏!

How it works...

We begin the recipe by importing the Markovify library, a library for Markov chain computations, and reading in text, which will inform our Markov model (step 1). In step 2, we create a Markov chain model using the text. The following is a relevant snippet from the text object's initialization code:

class Text(object):

reject_pat = re.compile(r"(^')|('$)|\s'|'\s|[\"(\(\)\[\])]")

def __init__(self, input_text, state_size=2, chain=None, parsed_sentences=None, retain_original=True, well_formed=True, reject_reg=''):
"""
input_text: A string.
state_size: An integer, indicating the number of words in the model's state.
chain: A trained markovify.Chain instance for this text, if pre-processed.
parsed_sentences: A list of lists, where each outer list is a "run"
of the process (e.g. a single sentence), and each inner list
contains the steps (e.g. words) in the run. If you want to simulate
an infinite process, you can come very close by passing just one, very
long run.
retain_original: Indicates whether to keep the original corpus.
well_formed: Indicates whether sentences should be well-formed, preventing
unmatched quotes, parenthesis by default, or a custom regular expression
can be provided.
reject_reg: If well_formed is True, this can be provided to override the
standard rejection pattern.
"""

The most important parameter to understand is state_size = 2, which means that the Markov chains will be computing transitions between consecutive pairs of words. For more realistic sentences, this parameter can be increased, at the cost of making sentences appear less original. Next, we apply the Markov chains we have trained to generate a few example sentences (steps 3 and 4). We can see clearly that the Markov chains have captured the tone and style of the text. Finally, in step 5, we create a few tweets in the style of the airport reviews using our Markov chains.

主站蜘蛛池模板: 呼和浩特市| 漳浦县| 吴忠市| 永顺县| 崇左市| 琼结县| 腾冲县| 昆明市| 英吉沙县| 修武县| 邻水| 恭城| 玉门市| 大足县| 四会市| 嘉祥县| 蚌埠市| 外汇| 弥勒县| 唐河县| 平陆县| 永善县| 潮州市| 阿鲁科尔沁旗| 花莲市| 合川市| 民县| 镇巴县| 莎车县| 临澧县| 盘山县| 南宁市| 奈曼旗| 天门市| 五家渠市| 金寨县| 利川市| 庄浪县| 油尖旺区| 巢湖市| 梁平县|