PyKHTML site-scraping library

Developers Apps

Source (link to git-repo or to original if based on someone elses unmodified work): Add the source-code for this project on opencode.net

0
Score 50.0%
Description:

From the PyKHTML website at http://paul.giannaros.org/pykhtml:

"PyKHTML is...
A Python module for writing website scrapers/spiders. Whereas traditional methods focus on writing the code to parse HTML/forms themselves, PyKHTML uses the excellent KHTML engine to do all the trudge work. It therefore handles webpages very well (even the severely crufty ones) and is pretty darn fast (implemented in C++). As a bonus the module handles JavaScript and cookies transparently. Hurrah!"

Gogast

12 years ago

Is it possible to create thumbnails of web pages?

Report

C

cerulean

12 years ago

Not at the moment, but that should be easily accomplished. Is it something you'd find useful? If so, I could have a go at implementing it.

Report

C

cerulean

12 years ago

More or less implemented in the development repository (though with a requirement that you're in GUI debug mode, for the moment). You can get instructions on how to check it out at http://paul.giannaros.org/pykhtml/download.htm

Report

Gogast

12 years ago

Yes

What do you think about PyKDE ?
Are all KDE libraries already ported to Python ?

Report

C

cerulean

12 years ago

PyKDE is excellent. It wraps pretty much all of the functionality of kdelibs and is a pleasure to work with -- I would highly reocmmend it.
PyKDE4 is not ready yet, it will be released when the KDE4 API is stable and finalised.

Report

12345678910
product-maker Count: 4 Rating: 5.0
File (click to download) Version Description PackagetypeArchitectureRelease Channel Downloads Date Filesize DL OCS-Install MD5SUM
*Needs ocs-url or ocs-store to install things
Pling
Details
license
version
0.2
updated Apr 29 2007
added Apr 29 2007
downloads 24h
0
page views 24h 4
System Tags app software