pyspark.sql.functions.var_pop#

pyspark.sql.functions.var_pop(col)[source]#

Aggregate function: returns the population variance of the values in a group.

New in version 1.6.0.

Changed in version 3.4.0: Supports Spark Connect.

Parameters
colColumn or column name

target column to compute on.

Returns
Column

variance of given column.

See also

pyspark.sql.functions.variance()
pyspark.sql.functions.var_samp()
pyspark.sql.functions.std_pop()

Examples

>>> from pyspark.sql import functions as sf
>>> df = spark.range(6)
>>> df.select(sf.var_pop(df.id)).show()
+------------------+
|       var_pop(id)|
+------------------+
|2.9166666666666...|
+------------------+