Skip to content

Import Spark  #23

Open
Open
@imsanjoykb

Description

@imsanjoykb

import pyspark.sql.functions as F

df = df.withColumn("salt", F.round(100 * F.rand()))
.groupby(['dim1', 'dim2', 'salt']) \
.agg({'dim_to_sum': 'sum'})
.drop('salt')

Metadata

Metadata

Assignees

No one assigned

    Labels

    documentationImprovements or additions to documentation

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions