1. médialab Sciences Po
  2. Productions
  3. Issue2navicrawler

Issue2navicrawlermade by the médialab

a python library to transform a DMI's IssueCrawler XML file into a Navicrawler wxsf (XML) one

Tools – Code

Paul Girard

This (deprecated) Python library allowed to transform a web corpus built with the IssueCrawler (from Amsterdam's Digital Methods Initiative) into the format accepted by the Navicrawler.

Both approachs could this way be mixed: build a corpus automatically through crawling, then curate results manually within a tuned web browser.

It allowed experiments which further led to the development of Hyphe and the associated Hyphe Browser.

processing

developers

archived

2010