io.archivesunleashed.app
Extract web graph from web archive using DataFrame and Spark SQL.
DataFrame obtained from RecordLoader
Dataset[Row], where the schema is (crawl date, src, image url, alt text)