Methodology

Data Collection

Our automated pipeline collects news articles from major Irish news publications every 6 hours via Google News RSS feeds. We search for articles containing keywords related to road fatalities in Ireland.

Data Extraction

Each article is processed using AI (Claude) to extract structured information:

  • Date of incident
  • Location (county, town, road)
  • Number of fatalities
  • Vehicle types involved

Geocoding

Locations are geocoded using OpenStreetMap's Nominatim service to place incidents on the map. When exact locations cannot be determined, we fall back to town centroids or county centroids.

Deduplication

Multiple news articles often cover the same incident. We deduplicate based on date, county, and fatality count to avoid double-counting.

Limitations

  • Not all fatalities are reported in news media
  • There may be a delay between an incident and news coverage
  • Location accuracy varies based on news report detail
  • This is not official data - refer to the RSA for official statistics

Updates

Data is collected automatically every 6 hours. The site rebuilds and deploys whenever new data is available.