Scallop option reader constructed with class CmdAppConf
Prepare for invoking extractors.
Prepare for invoking extractors.
Any
Process the handler.
Process the handler.
Any
Routine for saving Dataset obtained from querying DataFrames to CSV.
Routine for saving Dataset obtained from querying DataFrames to CSV. Files may be merged according to options specified in 'partition' setting.
generic dataset obtained from querying DataFrame
Unit
Routine for saving Dataset obtained from querying DataFrames to Parquet.
Routine for saving Dataset obtained from querying DataFrames to Parquet. Files may be merged according to options specified in 'partition' setting.
generic dataset obtained from querying DataFrame
Unit
Set the app name.
Set the app name.
String
Set Spark context to be used.
Set Spark context to be used.
either a brand new or existing Spark context
Verify the validity of command line arguments regarding input and output files.
Verify the validity of command line arguments regarding input and output files.
All input files need to exist, and ouput files should not exist, for this to pass. Throws exception if condition is not met.
Unit
IllegalArgumentException
exception thrown
Main application that parse command line arguments and invoke appropriate extractor.