官术网_书友最值得收藏!

How it works...

We begin the recipe by importing the Markovify library, a library for Markov chain computations, and reading in text, which will inform our Markov model (step 1). In step 2, we create a Markov chain model using the text. The following is a relevant snippet from the text object's initialization code:

class Text(object):

reject_pat = re.compile(r"(^')|('$)|\s'|'\s|[\"(\(\)\[\])]")

def __init__(self, input_text, state_size=2, chain=None, parsed_sentences=None, retain_original=True, well_formed=True, reject_reg=''):
"""
input_text: A string.
state_size: An integer, indicating the number of words in the model's state.
chain: A trained markovify.Chain instance for this text, if pre-processed.
parsed_sentences: A list of lists, where each outer list is a "run"
of the process (e.g. a single sentence), and each inner list
contains the steps (e.g. words) in the run. If you want to simulate
an infinite process, you can come very close by passing just one, very
long run.
retain_original: Indicates whether to keep the original corpus.
well_formed: Indicates whether sentences should be well-formed, preventing
unmatched quotes, parenthesis by default, or a custom regular expression
can be provided.
reject_reg: If well_formed is True, this can be provided to override the
standard rejection pattern.
"""

The most important parameter to understand is state_size = 2, which means that the Markov chains will be computing transitions between consecutive pairs of words. For more realistic sentences, this parameter can be increased, at the cost of making sentences appear less original. Next, we apply the Markov chains we have trained to generate a few example sentences (steps 3 and 4). We can see clearly that the Markov chains have captured the tone and style of the text. Finally, in step 5, we create a few tweets in the style of the airport reviews using our Markov chains.

主站蜘蛛池模板: 视频| 宜兰县| 天台县| 龙游县| 曲阜市| 宁陕县| 仁寿县| 达日县| 时尚| 阜新| 祥云县| 监利县| 德州市| 华坪县| 凌云县| 临邑县| 肃宁县| 灵台县| 施甸县| 宁国市| 中宁县| 武川县| 曲靖市| 永丰县| 米林县| 都昌县| 马尔康县| 高淳县| 沭阳县| 璧山县| 津南区| 望城县| 淳化县| 和田市| 繁峙县| 锡林郭勒盟| 扶风县| 凤山市| 平乡县| 朔州市| 蚌埠市|