html-table-parser-python3

This module consists of just one small class. Its purpose is to parse HTML tables without help of external modules. Everything I use is part of python 3. Instead of installing this module, you can just copy the class located in parse.py into your own code.

How to use

Probably best shown by example using pyenv for convenience:

pyenv local
python ./example_of_usage.py

The parser returns a nested lists of tables containing rows containing cells as strings. Tags in cells are stripped and the tags text content is joined. The console output for parsing all tables on the twitter home page looks like this:

>>> 
[[['', 'Anmelden']],
 [['Land', 'Code', 'Für Kunden von'],
  ['Vereinigte Staaten', '40404', '(beliebig)'],
  ['Kanada', '21212', '(beliebig)'],
  ...
  ['3424486444', 'Vodafone'],
  ['Zeige SMS-Kurzwahlen für andere Länder']]]

CLI

There is also a command line interface which you can use directly to generate a CSV:

./html_table_converter -u http://metal-train.de/index.php/fahrplan.html -o metaltrain

If you need help for the supported parameters append -h:

./html_table_converter -h

Tests

Sadly there are none. I'd really be interested in a PR since I don't know the python ecosystem well.

Name	Name	Last commit message	Last commit date
Latest commit History 29 Commits 29 Commits
html_table_parser	html_table_parser
.gitignore	.gitignore
.python-version	.python-version
LICENSE	LICENSE
README.md	README.md
example_of_usage.py	example_of_usage.py
html_table_converter	html_table_converter

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

html-table-parser-python3

How to use

CLI

Tests

About

Uh oh!

Releases

Packages

Languages

Search code, repositories, users, issues, pull requests...

License

boazde/html-table-parser-python3

Folders and files

Latest commit

History

Repository files navigation

html-table-parser-python3

How to use

CLI

Tests

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages