Package

software.uncharted.sparkpipe.ops.core.dataframe

io

Permalink

package io

Input/output operations for DataFrames, based on the sparkSession.read and DataFrame.write APIs

Linear Supertypes
AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. io
  2. AnyRef
  3. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Value Members

  1. object docs

    Permalink

    Stub object necessary due to https://issues.scala-lang.org/browse/SI-8124

    Stub object necessary due to https://issues.scala-lang.org/browse/SI-8124

    Documentation for ops.core.dataframe.io can be found at software.uncharted.sparkpipe.ops.core.dataframe.io

    Attributes
    protected[this]
    See also

    software.uncharted.sparkpipe.ops.core.dataframe.io

  2. def read(path: String, format: String = "parquet", options: Map[String, String] = Map[String, String](), schema: StructType = ...)(sparkSession: SparkSession): DataFrame

    Permalink

    Create a DataFrame from an input data source

    Create a DataFrame from an input data source

    path

    A format-specific location String for the source data

    format

    Specifies the input data source format (parquet by default)

    options

    A Map[String, String] of options

    schema

    A StructType schema to apply to the source data

    returns

    a DataFrame createad from the specified source

  3. def write(path: String, format: String = "parquet", options: Map[String, String] = Map[String, String]())(input: DataFrame): DataFrame

    Permalink

    :: Experimental :: Writes a DataFrame to an output data format/location

    :: Experimental :: Writes a DataFrame to an output data format/location

    path

    A format-specific location String for the source data

    format

    Specifies the output data source format (parquet by default)

    options

    A Map[String, String] of options

    returns

    the input DataFrame, unchanged

Inherited from AnyRef

Inherited from Any

Ungrouped