ScraperWiki » scraping

Channel: ScraperWiki » scraping

Image may be NSFW.
Clik here to view.

Scrape it – Save it – Get it

April 11, 2011, 11:22 am

I imagine I’m talking to a load of developers. Which is odd seeing as I’m not a developer. In fact, I decided to lose my coding virginity by riding the ScraperWiki digger! I’m a journalist interested...

View Article

Image may be NSFW.
Clik here to view.

ScraperWiki: A story about two boys, web scraping and a worm

May 5, 2011, 3:06 am

“It’s like a buddy movie.” she said. Not quite the kind of story lead I’m used to. But what do you expect if you employ journalists in a tech startup? “Tell them about that computer game of his that...

View Article

Image may be NSFW.
Clik here to view.

Access government in a way that makes sense to you? Surely not!

May 11, 2011, 8:07 am

alpha.gov.uk uses Scraperwiki, a cutting edge data-gathering tool, to deliver the results that citizens want. And radically for government, rather than tossing a finished product out onto the web with...

View Article

Image may be NSFW.
Clik here to view.

Scraping guides: Excel spreadsheets

September 14, 2011, 8:55 am

Following on from the CSV scraping guide, we’ve now added one about scraping Excel spreadsheets. You can get to them from the documentation page. The Excel scraping guide is available in Ruby, Python...

View Article

Image may be NSFW.
Clik here to view.

How to get along with an ASP webpage

November 9, 2011, 4:14 am

Fingal County Council of Ireland recently published a number of sets of Open Data, in nice clean CSV, XML and KML formats. Unfortunately, the one set of Open Data that was difficult to obtain, was the...

View Article

Image may be NSFW.
Clik here to view.

Fine set of graphs at the Office of National Statistics

March 22, 2012, 4:47 am

It’s difficult to keep up. I’ve just noticed a set of interesting interactive graphs over at the Office of National Statistics (UK). If the world is about people, then the most fundamental dataset of...

View Article

Image may be NSFW.
Clik here to view.

Local ScraperWiki Library

June 7, 2012, 8:24 am

It quite annoyed me that you can only use the scraperwiki library on a ScraperWiki instance; most of it could work fine elsewhere. So I’ve pulled it out (well, for Python at least) so you can use it...

View Article

Image may be NSFW.
Clik here to view.

Software Archaeology and the ScraperWiki Data Challenge at #europython

June 29, 2012, 2:24 am

There’s a term in technical circles called “software archaeology” – it’s when you spend time studying and reverse-engineering badly documented code, to make it work, or make it better. Scraper writing...

View Article

Image may be NSFW.
Clik here to view.

Three hundred thousand tonnes of gold

July 4, 2012, 1:17 pm

On 2 July 2012, the US Government debt to the penny was quoted at $15,888,741,858,820.66. So I wrote this scraper to read the daily US government debt for every day back to 1996. Unfortunately such a...

View Article

Image may be NSFW.
Clik here to view.

Digging Olympic Data at Londinium MMXII

July 24, 2012, 2:50 am

This is a guest post by Makoto Inoue, one of the organisers of this weekend’s Londinium MMXII hackathon. The Olympics! Only a few days to go until seemingly every news camera on the planet is pointed...

View Article

Image may be NSFW.
Clik here to view.

Scraping Spreadsheets with XYPath

March 12, 2014, 10:11 am

Spreadsheets are great. They’re ubiquitously available, beaten only by the web pages and the word processor documents. Like the word processor, they’re easy to use and give the user a blank page, but...

View Article

Latest Images