pyspark
Projects with this topic
-
Tumult Core is a collection of composable components for implementing algorithms to perform differentially private computations.
Updated -
Tumult Analytics is a Python library for privately computing aggregate queries on tabular data. It is built atop the Tumult Core library.
Updated -
-
Baza wiedzy z tematów Databricks i PySpark.
Updated -
-
Deploying PySpark Jobs on Azure HDInsight Spark Cluster (CI/CD)
Updated -
Funnel provide an easy to use, easy to read framework to create very complex data selections over pandas DataFrames
Updated -
"Cloud container data analytics, statistical modeling, and machine learning on distributed databases". "A free opensource alternative to SPSS, SAS, MATLAB, PowerBI, Tableau and Alteryx". Runs on Linux, Windows, MacOS, and in the cloud via containers.
Updated -
-
Yelp open dataset explorer using spark and cassandra
Updated