官术网_书友最值得收藏!

Text parsers

Parsing is something that web apps need to do quite often. Opa features a built-in syntax for building text parsers, which are first class values just as functions. The parser is based on parsing expression grammar (http://en.wikipedia.org/wiki/Parsing_expression_grammar), which may look like regular expressions at first, but do not behave anything like them. One big advantage of text parsers over regular expressions is that you can easily combine parsers. A good example is parsing URLs. Let's start right away with our first Opa parser:

first_parser = parser { 
case "Opa"  : 1 
}

For first_parser, the expressions are just literal strings, which means this parser will succeed only if fed with the string "Opa". Then how to use this parser? The module Parser (http://doc.opalang.org/module/stdlib.core.parser/Parser) has a bunch of functions to deal with parsers. The most important one is:

Parser.try_parse : Parser.general_parser('a), string -> option('a)

It takes a parser and a string as parameters and produces an optional value of some type. Let's see how to use this function:

x = Parser.try_parse(parser1,"Opa")  //x = {some: 1}
y = Parser.try_parse(parser1,"Java") //y = {none}

Now let's consider the following parsers:

digit1 = parser { case x=[0-9]+: x }
digit2 = parser { case x=([0-9]+): x }

Both digit1 and digit2 accept a number string like "5","100", and both will assign the value to the identifier x. If we feed the parser digit1 with the string "100", x will be the parsing result of the string: a list of characters ['1','0','0']. If we feed the string "100" to parser digit2, x will be the input string: 100. So, if we want to get hold of the input string, we need to put the expression in parentheses.

Let's move it a little further; consider the following parser:

abs_parser = parser{
  case x=("+"?[0-9]+): Int.of_string("{x}")
  case x=("-"[0-9]+) : 0 – Int.of_string("{x}")
}
x = Parser.try_parse(abs_parser,"-100") // x = {some: 100}

This parser accepts an integer string and returns the absolute value. You may figure out how it works with the previous knowledge. Note that even if the expression of PEG looks like a regular expression, they are different.

主站蜘蛛池模板: 防城港市| 宜都市| 玛纳斯县| 敦煌市| 历史| 阿图什市| 澄城县| 长宁县| 通海县| 屯昌县| 桑植县| 驻马店市| 博兴县| 海伦市| 融水| 壤塘县| 绍兴市| 富锦市| 富锦市| 万源市| 孝义市| 罗城| 上高县| 格尔木市| 清流县| 弥渡县| 泰州市| 广州市| 甘南县| 卫辉市| 镇安县| 阿拉善右旗| 平遥县| 加查县| 沙田区| 满洲里市| 昌图县| 聊城市| 邻水| 曲靖市| 伊宁县|