Take an existing DataFrame, and add a new column to it.
Take an existing DataFrame, and add a new column to it.
The name of the column to add
A function which generates the new column's values based on input columns
the first input column
the second input column
the third input column
the fourth input column
the fifth input column
The existing DataFrame
A new DataFrame with the named added value.
Take an existing DataFrame, and add a new column to it.
Take an existing DataFrame, and add a new column to it.
The name of the column to add
A function which generates the new column's values based on input columns
the first input column
the second input column
the third input column
the fourth input column
The existing DataFrame
A new DataFrame with the named added value.
Take an existing DataFrame, and add a new column to it.
Take an existing DataFrame, and add a new column to it.
The name of the column to add
A function which generates the new column's values based on input columns
the first input column
the second input column
the third input column
The existing DataFrame
A new DataFrame with the named added value.
Take an existing DataFrame, and add a new column to it.
Take an existing DataFrame, and add a new column to it.
The name of the column to add
A function which generates the new column's values based on input columns
the first input column
the second input column
The existing DataFrame
A new DataFrame with the named added value.
Take an existing DataFrame, and add a new column to it.
Take an existing DataFrame, and add a new column to it.
The name of the column to add
A function which generates the new column's values based on input columns
the input column
The existing DataFrame
A new DataFrame with the named added value.
Take an existing DataFrame, and add a new column to it.
Take an existing DataFrame, and add a new column to it.
The name of the column to add
A function which generates the new column's values based on input columns
The existing DataFrame
A new DataFrame with the named added value.
cache() the specified DataFrame
cache() the specified DataFrame
the DataFrame to cache()
the input DataFrame, after calling cache()
Cast a set of columns to new types, replacing the original columns
Cast a set of columns to new types, replacing the original columns
a Map[String, String] of columnName => datatype
The existing DataFrame
A new DataFrame with the casted columns
Takes a DataFrame and copies a column in i
Takes a DataFrame and copies a column in i
the column to copy
the column to place the copy in
The existing DataFrame
A new DataFrame with the copied column
Stub object necessary due to https://issues.scala-lang.org/browse/SI-8124
Stub object necessary due to https://issues.scala-lang.org/browse/SI-8124
Documentation for ops.core.dataframe
can be found at software.uncharted.sparkpipe.ops.core.dataframe
Remove columns from a DataFrame
Remove columns from a DataFrame
the named columns to remove
the input DataFrame
the resultant DataFrame, without the specified column
Input/output operations for DataFrames, based on the sparkSession.read
and DataFrame.write
APIs
Inner join two data frames on the specified columns.
Inner join two data frames on the specified columns.
The join ID column of the first data frame
The join ID column of the second data frame
The first data frame
The second data frame
The joined data frames
Numeric pipeline operations that operate on DataFrames.
Rename columns in a DataFrame
Rename columns in a DataFrame
a Map[String, String] from columns in the DataFrame to new names
the input DataFrame
a new DataFrame with the renamed column
Takes a DataFrame and replaces a column in it using a transformation function
Takes a DataFrame and replaces a column in it using a transformation function
the column to replace
The existing DataFrame
A new DataFrame with the replaced column
Common pipeline operations for dealing with temporal data
Common pipeline operations for dealing with textual data
Convert a DataFrame to an RDD[Row]
Convert a DataFrame to an RDD[Row]
the DataFrame
the underlying RDD[Row] from frame
Common operations for manipulating dataframes