enow.com Web Search

Search results

  1. Results from the WOW.Com Content Network
  2. Pdf-parser - Wikipedia

    en.wikipedia.org/wiki/Pdf-parser

    Pdf-parser is a command-line program that parses and analyses PDF documents. It provides features to extract raw data from PDF documents, like compressed images. pdf-parser can deal with malicious PDF documents that use obfuscation features of the PDF language. [1] The tool can also be used to extract data from damaged or corrupt PDF documents.

  3. PLY (software) - Wikipedia

    en.wikipedia.org/wiki/PLY_(software)

    PLY is a parsing tool written purely in Python. It is, in essence, a re-implementation of Lex and Yacc originally in C-language . It was written by David M. Beazley .

  4. List of PDF software - Wikipedia

    en.wikipedia.org/wiki/List_of_PDF_software

    PDF Tools allows creation of PDFs from many types of source input (images, scans, etc.). The PDF-XChange print driver allows printing directly to a PDF. A "lite" version of the print driver is free for non-commercial (home and academic) use. PrimoPDF: Proprietary: Yes: Virtual printer, for Microsoft .NET Framework and uses Ghostscript and RedMon.

  5. Comparison of parser generators - Wikipedia

    en.wikipedia.org/wiki/Comparison_of_parser...

    However, parser generators for context-free grammars often support the ability for user-written code to introduce limited amounts of context-sensitivity. (For example, upon encountering a variable declaration, user-written code could save the name and type of the variable into an external data structure, so that these could be checked against ...

  6. Natural Language Toolkit - Wikipedia

    en.wikipedia.org/wiki/Natural_Language_Toolkit

    Parse tree generated with NLTK. The Natural Language Toolkit, or more commonly NLTK, is a suite of libraries and programs for symbolic and statistical natural language processing (NLP) for English written in the Python programming language. It supports classification, tokenization, stemming, tagging, parsing, and semantic reasoning ...

  7. Beautiful Soup (HTML parser) - Wikipedia

    en.wikipedia.org/wiki/Beautiful_Soup_(HTML_parser)

    Beautiful Soup is a Python package for parsing HTML and XML documents, including those with malformed markup. It creates a parse tree for documents that can be used to extract data from HTML, [3] which is useful for web scraping. [2] [4]

  8. Packrat parser - Wikipedia

    en.wikipedia.org/wiki/Packrat_parser

    The Packrat parser is a type of parser that shares similarities with the recursive descent parser in its construction. However, it differs because it takes parsing expression grammars (PEGs) as input rather than LL grammars .

  9. Flex (lexical analyser generator) - Wikipedia

    en.wikipedia.org/wiki/Flex_(lexical_analyser...

    Flex (fast lexical analyzer generator) is a free and open-source software alternative to lex. [2] It is a computer program that generates lexical analyzers (also known as "scanners" or "lexers").