Do NER classification on input path, output JSON.
Do NER classification on input path, output JSON.
path of classifier file
path of file with tuples (date: String, url: String, content: String) from which to extract entities
path of output file (e.g., "entities.json")
Spark context object
Combines directory of part-files containing one JSON array per line into a single file containing a single JSON array of arrays.
Combines directory of part-files containing one JSON array per line into a single file containing a single JSON array of arrays.
name of directory holding files, also name that will be given to JSON file.
Classifies records using NER and stores results as JSON