Story

Show HN: Roma Data Pipeline – Open-Source Ancient Rome Data

smurfysmurf Tuesday, January 06, 2026

I created a data pipeline that aggregates data from

Pleiades Stanfords ORBIS Harvards DARMC AWMC ToposText WikiData

and then normalizes it all into a single schema and builds a SQLite database.

I did this in support of a free app I developed that gives a Google/Apple maps like experience for ancient Rome.

I had a difficult time finding good, joined data sources and created my own. I thought I might opensource it with the hope that 1) Others may find some value in this (especially academics) 2) Others can contribute and grow the breadth and depth of the data

Open to all feedback or thoughts and also any points to relevant datasets or similar projects that might exist

Github: https://github.com/thomaspalaio/roma-data-pipeline (P.s. first time doing a side project / open source)

Summary
This article describes the creation of a data pipeline for the Roma football club, which involves collecting and preprocessing data from various sources, storing it in a data warehouse, and using it to generate insights and reports for the club's management and staff.
1 0
Summary
github.com
Visit article Read on Hacker News