WebRexp

Query language designed to extract information from a bunch of HTML files. The idea is to be able to extract information from a bunch of files linked between each others, using a syntax similar to regexp.

Using

For tutorial and examples, please see the project's wiki. As a teasing, here is some samples for a tiny command line for listing a RSS stream's title :

webrexp '"http://someurl.com" >> item title [.]'

And another one to dump all the images from a webpage

webrexp '"http://someurl.com" >> img [.]'

Hackage

Haskell users can directly grab releases on Hackage.

Building

To build the webrexp project you must have GHC (Glasgow Haskell Compiler) installed and some cabal package. To know which package your are missing, in the source folder type :

make conf

A list of missing package should be shown, then you can

cabal install packagename

for every missing package and finally type maketo build. Or you can download the binary. Binary are simpler. So fucking simpler.

Name		Name	Last commit message	Last commit date
Latest commit History 139 Commits
Text		Text
Webrexp.wiki @ f45b417		Webrexp.wiki @ f45b417
extras/vim		extras/vim
test		test
.gitignore		.gitignore
.gitmodules		.gitmodules
Makefile		Makefile
README.markdown		README.markdown
Setup.hs		Setup.hs
Webrexp.cabal		Webrexp.cabal
dotify.ps1		dotify.ps1
webrexpMain.hs		webrexpMain.hs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Text

Text

Webrexp.wiki @ f45b417

Webrexp.wiki @ f45b417

extras/vim

extras/vim

test

test

.gitignore

.gitignore

.gitmodules

.gitmodules

Makefile

Makefile

README.markdown

README.markdown

Setup.hs

Setup.hs

Webrexp.cabal

Webrexp.cabal

dotify.ps1

dotify.ps1

webrexpMain.hs

webrexpMain.hs

Repository files navigation

WebRexp

Using

Hackage

Building

About

Releases

Packages

Languages

Twinside/Webrexp

Folders and files

Latest commit

History

Repository files navigation

WebRexp

Using

Hackage

Building

About

Resources

Stars

Watchers

Forks

Languages