io.archivesunleashed.spark.matchbox
UDF for extracting image links from a webpage given the HTML content (using Jsoup).
the src link
the content from which links are to be extracted Returns a sequence of image links
UDF for extracting image links from a webpage given the HTML content (using Jsoup).