-
Notifications
You must be signed in to change notification settings - Fork 326
Spark 3.0 readiness part 1 #647
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Spark 3.0 readiness part 1 #647
Conversation
merging latest from master
Can you resolve conflicts? Thanks! |
After merging from master, I can see VectorUdf tests failing in |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Few nits, but LGTM, thanks @Niharikadutta!
public void PrintSchema(int level) => | ||
Console.WriteLine( | ||
(string)((JvmObjectReference)_jvmObject.Invoke("schema")) | ||
.Invoke("treeString", level)); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
indentation looks off?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Changed, please let me know if that looks better!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You can try the following if the indentation looks funny.
public void PrintSchema(int level)
{
var schema = (JvmObjectReference)_jvmObject.Invoke("schema");
Console.WriteLine((string)schema.Invoke("treeString", level));
}
This PR adds support for following APIs added in Spark 3.0:
SparkSession APIs
python
def getActiveSession(cls)scala
def executeCommand(runner: String, command: String, options: Map[String, String]): DataFrameDataFrame APIs
scala
/python
def tail(n: Int): Array[T]scala
printSchema(level: Int): Unitscala
def explain(mode: String): Unitscala
def observe(name: String, expr: Column, exprs: Column*): Dataset[T]DataFrameStatFunctions APIs
scala
def sampleBy[T](col: Column, fractions: Map[T, Double], seed: Long): DataFrameThis PR addresses part of #633