Extract Image details from web archive using Data Frame and Spark SQL.
Extract Image details from web archive using Data Frame and Spark SQL.
Data frame obtained from RecordLoader
Dataset[Row], where the schema is (crawl_date, url, filename, extension, mime_type_server, mime_type_tika, width, height, MD5, SHA1, body)
Extracts image details given raw bytes.