null
US
Sign In
Sign Up for Free
Sign Up
We have detected that Javascript is not enabled in your browser. The dynamic nature of our site means that Javascript must be enabled to function properly. Please read our
terms and conditions
for more information.
Next up
Copy and Edit
You need to log in to complete this action!
Register for Free
6705445
SPARK
Description
Mind Map on SPARK, created by BOGDAN SHEVCHENKO on 18/10/2016.
Mind Map by
BOGDAN SHEVCHENKO
, updated more than 1 year ago
More
Less
Created by
BOGDAN SHEVCHENKO
about 9 years ago
15
0
0
Resource summary
SPARK
RDD
Действия
Set
Intersection(otherSet)
union(otherSet)
cartesian(otherSet)
Functional
filter(func)
map(func)
distinct
Трансформации
saveAsTextFile(path)
array
collect()
take(n)
count
drop(n)
reduce(function)
Annotations:
функция должна быть коммутативной и ассоциативной
MapReduce
WorkFlow
SparkContext
pyspark.sql.SparkSession (sparkContext)
pyspark.sql.SparkSession (sparkContext)
Modules
pyspark.sql
Annotations:
http://spark.apache.org/docs/latest/api/python/pyspark.sql.html
functions
udf
pyspark.streaming
pyspark.ml
pyspark.mllib
Annotations:
https://habrahabr.ru/company/mlclass/blog/251471/
linalg
Vectors
dense
sparse
stat
Statistics
colStats
mean
numNonzeros
variance
corr
feature
StandardScaler
Annotations:
scaler = StandardScaler(withMean=True, withStd=True).fit(features) scaler.transform (features.map(lambda x:x.toArray()))
classification
LogisticRegressionWithSGD
RidgeRegressionWithSGD
NaiveBayes
tree
DecisionTree
RandomForest
clustering
KMeans
recommendation
ALS
Annotations:
Коллаборативная фильтрация
Shuffle
Annotations:
https://0x0fff.com/spark-architecture-shuffle/
Show full summary
Hide full summary
Want to create your own
Mind Maps
for
free
with GoConqr?
Learn more
.
Similar
WordCount
Nilesh Patel
Filter and Map
Nilesh Patel
Joins
Nilesh Patel
Setup spark scala in windows
Nilesh Patel
Test
Maciek Brynski
Browse Library