Create a NER Entity.
Counts the entities from a NER operation.
Creates a NER record from a date and a domain.
Do NER classification on input path, output JSON.
Do NER classification on input path, output JSON.
path of classifier file
path of file with tuples (date: String, url: String, content: String) from which to extract entities
path of output file (e.g., "entities.json")
Spark context object
Merges the counts from two lists of tuples.
Merges the counts from two lists of tuples.
the first list of tuples (String, Count)
the second list of tuples to merge into keyCount1
combined list of (key, Count) tuples.
Combines directory of part-files containing one JSON array per line into a single file containing a single JSON array of arrays.
Combines directory of part-files containing one JSON array per line into a single file containing a single JSON array of arrays.
name of directory holding files, also name that will be given to JSON file
Unit().
Classifies records using NER and stores results as JSON.