Make a quick html data extractor with REBOL (video)
July 19th, 2008
Image via WikipediaI will soon need to make some sort of web-spider. It seems to me that a very good choice for making one is REBOL. I was programming in REBOL years ago so I forgot most of things. REBOL recently released a very much awaited version 3 (I was a little afraid it is more of a “vaporware” ), and after few years showed that Carl Sassenrath still “means business” with it. So my interest in this interesting language is renoved too. Another big plus is that 3.0 greatly solves some issues that I had with the original REBOL (general closeness, artif. limits in Core version, async stuff). I decided to make a quick video of the procedure so that I get you interested in this excellent language too. Yes you, who did you think I was talking about
I do the following in the video: I load a webpage of a “last 100 motorbike ads” thru HTTP, I parse the unnecesarry html away first then I parse the table that holds the ads into separate rows first and then each row into concrete data.
Interesting tidbits: do you see how I don’t enquote http address (REBOL has many datatypes and URL is one of them) or XML tags (the same reason). Most of the time I just use the parse word which moves me into parse DIALECT. Most languages have functions, classes but rebol also has dialects. parse dialect is just one of them and you can create your own as you can create your own functions for example.
I used only the most basic features of parse dialect here. It’s an unbelievably powerfull feture.

Just to inform you there is a new project going on here. It’s a project where no programming is needed (at least for now). It’s a card battle game, but not the trading cards battle kind. I tried to make it easy to learn and hard to master. We learned my wife yesterday in few minutes and next thing she beat us because we weren’t careful. So the rules I think are still very minimal while allowing enough combinatorial complexity that the thing is interesting. My sister is drawing the cards images now and once that is done we will start making it more public. I made a very simple website where I will update info about the game.







