Toggle navigation
Big Data Memo
Home
About
Category
Tags
Talks
Tags
keep hungry keep foolish
Machine Learning
Spark
ME
Other
CFAL1
Python
maths
Baby
Big Data
大数据
IT
机器学习
Scala
CFAL2
Deep Learning
Travel
Kafka
Tools
tmp
Glossary
English
wordcloud
quant
python
work
r
scala
故障预测
Clustering
finance
bigdata
mahout
CNN
ML
Graphx
smtplib
email
bottle
FDA
Machine Learning
Spark-ML-0501-Clustering-KMeans
K-means is one of the most commonly used clustering algorithms that clusters the data points into a predefined number of clusters.
Spark-ML-0502-Clustering-Gaussian mixture
高斯混合模型
Spark-ML-05-Clustering
Clustering is an unsupervised learning problem whereby we aim to group subsets of entities with one another based on some notion of similarity. Clustering is often used for exploratory analysis and/or as a component of a hierarchical supervised learning pipeline (in which distinct classifiers or regression models are trained for each cluster).
Spark-ML-030103-Linear Model-Regression
线性模型:Regression
Spark-ML-030101-Linear Model-LSVM
线性模型:LSVM
Spark-ML-030102-Linear Model-LR
线性模型:LR
Spark-ML-0301-Linear Model
线性模型
Spark-ML-0305-Isotonic regression
Isotonic regression
Spark-ML-0304-Ensembles-RF
RF
Spark-ML-0304-Ensembles-Gradient-Boosted Trees (GBTs)
Gradient-Boosted Trees (GBTs)
Spark-ML-0304-Ensembles
Ensembles 集成学习
Spark-ML-03-Classification-Regression
分类与回归
Fault detection and diagnosis Approaches
故障预测与故障诊断
Spark-ML-0302-Naive Bayes
Spark机器学习算法学习——Classificition and Regression——Naive Bayes
Spark-ML-0303-Decision Tree
Spark机器学习算法学习——Classificition and Regression——Decision Tree
推荐系统中的评测指标
推荐系统中的评测指标
推荐系统中的相似度计算方法总结
推荐系统中的相似度计算方法
Spark-ML-04-collaborative-filtering
Spark机器学习算法学习——CollaborativeFiltering——ALS
Spark-ML-0203-Stratified sampling
Spark机器学习算法学习——BasicStatistics——Stratified sampling
Spark-ML-0201-Summary statistics
Spark机器学习算法学习——BasicStatistics——Summary statistics
Spark-ML-0205-Random data generation
Spark机器学习算法学习——BasicStatistics——Random data generation
Spark-ML-0206-Kernel density estimation
Spark机器学习算法学习——BasicStatistics——Kernel density estimation
Spark-ML-0204-Hypothesis testing
Spark机器学习算法学习——BasicStatistics——Hypothesis testing
Spark-ML-0202-Correlations
Spark机器学习算法学习——BasicStatistics——Correlations
Spark-ML-00
Spark机器学习算法学习00
LDA
LDA的通俗解释from quora
Spark-ML-01
Spark机器学习算法学习——数据类型
Spark
Graphx operation 2[转]
Grapregel 与 spark graphX 的 pregel api
Graphx operation 1
Graph Operators
Graphx prelimery
graphx-programming-guide
Secondary Sort
Secondary Sort 实际上就是一种对Value进行二次排序
Spark-ML-0501-Clustering-KMeans
K-means is one of the most commonly used clustering algorithms that clusters the data points into a predefined number of clusters.
Spark-ML-0502-Clustering-Gaussian mixture
高斯混合模型
Spark-ML-05-Clustering
Clustering is an unsupervised learning problem whereby we aim to group subsets of entities with one another based on some notion of similarity. Clustering is often used for exploratory analysis and/or as a component of a hierarchical supervised learning pipeline (in which distinct classifiers or regression models are trained for each cluster).
How to use SparkSession in Apache Spark 2.0
How to use SparkSession in Apache Spark 2.0. A unified entry point for manipulating data with Spark
Spark-ML-0302-Naive Bayes
Spark机器学习算法学习——Classificition and Regression——Naive Bayes
Spark-ML-0303-Decision Tree
Spark机器学习算法学习——Classificition and Regression——Decision Tree
Spark-ML-04-collaborative-filtering
Spark机器学习算法学习——CollaborativeFiltering——ALS
Spark-ML-0203-Stratified sampling
Spark机器学习算法学习——BasicStatistics——Stratified sampling
Spark-ML-0201-Summary statistics
Spark机器学习算法学习——BasicStatistics——Summary statistics
Spark-ML-0205-Random data generation
Spark机器学习算法学习——BasicStatistics——Random data generation
Spark-ML-0206-Kernel density estimation
Spark机器学习算法学习——BasicStatistics——Kernel density estimation
Spark-ML-0204-Hypothesis testing
Spark机器学习算法学习——BasicStatistics——Hypothesis testing
Spark-ML-0202-Correlations
Spark机器学习算法学习——BasicStatistics——Correlations
Spark-ML-00
Spark机器学习算法学习00
Python 2 Kafka 2 SparkStreaming
Python 2 Kafka 2 SparkStreaming
A Tale of Three Apache Spark APIs
RDDs, DataFrames, and Datasets. When to use them and why
Spark Streaming Lesson 1
Spark Streaming Programming Guide.
Scala Lesson 15
Spark With Scala
Spark-ML-01
Spark机器学习算法学习——数据类型
ME
travel site
Hotel
travel Map
Map
Why I Blog?
Why you learn? Why you live? Why you mark down? Here is the answer.
Welcome!
Welcome! Let's learn and think together.
Other
google trends
google trends
How to start applying for Analytics Data Science Masters in the US Universities?
Planning a masters program in data science in US? But, not completely aware of the application process? Or afraid of the application process?
IP
“知识财产”(Intellectual Property) “文学财产”(literary property)或“潜在财产”(underlying property)
This is a test!
Welcome! Let's learn and think together.
CFAL1
Lev1_AI_basic_concept
The very basic of Alternative Investment. There you go, only Finace surive.
Lev1_Derivative_basic_concept
The very basic of Derivative. There you go, only Finace surive.
Lev1_FixedIncome_basic_concept
The very basic of Fixed Income. There you go, only Finace surive.
Lev1_Equity_basic_concept
The very basic of Equity. There you go, only Finace surive.
Lev1_Portfolio_basic_concept
The very basic of Portfolio. There you go, only Finace surive.
Lev1_Corporate_Finance_basic_concept
The very basic_Corporate_Finance. There you go, only Finace surive.
Lev1_FSA_basic_concept
The very basic_Financial Statement Analysis. There you go, only Finace surive.
Lev1_Economics_basic_concept
The very basic_Economics. There you go, only Finace surive.
Lev1_Quants_basic_concept
The very basic_Quants. There you go, only Finace surive.
Python
ANS1
TensorFlow安装记录
TuShare Data
http://tushare.org/macro.html#id2
如何用Python发送邮件
使用python邮件模块自动发送HTML格式邮件
如何用Python和深度神经网络识别图像?[转录]
flask my first website
Flask应用
flask my first website
Flask应用
flask my first website
30分钟编写一个Flask应用
flask; 使用Stormpath来创建并管理用户账户和数据
Secondary Sort
Secondary Sort 实际上就是一种对Value进行二次排序
Spark-ML-030103-Linear Model-Regression
线性模型:Regression
Spark-ML-030101-Linear Model-LSVM
线性模型:LSVM
Spark-ML-030102-Linear Model-LR
线性模型:LR
Spark-ML-0301-Linear Model
线性模型
Spark-ML-0305-Isotonic regression
Isotonic regression
Spark-ML-0304-Ensembles-RF
RF
Spark-ML-0304-Ensembles-Gradient-Boosted Trees (GBTs)
Gradient-Boosted Trees (GBTs)
Spark-ML-0304-Ensembles
Ensembles 集成学习
Spark-ML-03-Classification-Regression
分类与回归
断网环境下利用pip安装Python离线安装包
离线断网环境下安装Python包,配置环境
远程访问jupyter notebook
默认只能在本地访问,如果想把它安装在服务器上,然后在本地远程访问,则需要进行如下配置
Zipline量化平台
方案备选一
How to use SparkSession in Apache Spark 2.0
How to use SparkSession in Apache Spark 2.0. A unified entry point for manipulating data with Spark
新三板文本项目
新三板POC文本项目说明
新三板聚类项目
新三板聚类Demo
wordcloud
One of the most powerful things we can do with text is find ways to visually represent the information it is expressing. Word clouds are one of the best methods of achieving this.
推荐系统中的评测指标
推荐系统中的评测指标
推荐系统中的相似度计算方法总结
推荐系统中的相似度计算方法
hackerrank_numpy
become an hacker need to parctice.
maths
Matrix operations and inverses
LA by Gilbert Strang
hackerrank_numpy
become an hacker need to parctice.
Baby
专家盘点生二胎的四点好处和三点坏处
专家盘点生二胎的四点好处和三点坏处!快来对号入座
dialogue about start up a business
start up your business.
Big Data
Hadoop 开机自启动
设置Hadoop程序开机自启动的方法
Zeppelin安装 快速指南
Zeppelin 安装
Spark安装 快速指南
Spark 安装
A Tale of Three Apache Spark APIs
RDDs, DataFrames, and Datasets. When to use them and why
SPSS for BigData Modeling
SPSS对接Hadoop分析模式。
Hadoop 集群安装 快速指南
Hadoop 安装
kafka组件深度解析
kafka组件深度解析
Hadoop Tutorial ---- 学习安排
Hive与Spark学习提纲
Hadoop Tutorial ---- Spark入门实战系列--1.Spark及其生态圈简介
Spark入门实战系列--1.Spark及其生态圈简介
培训虚拟机使用指南
使用指南
Hadoop Tutorial ---- Components Required
大数据学习与培训,Let‘s work together!
Hadoop Mesos
mesos和yarn区别
Hadoop Learn Guide
Guide of Hadoop Ecosystem.
Hadoop Resource
Limited site of Hadoop Resource.
HadoopMapReduce
Simplified Data Processing on Large Clusters
Hadoop Mahout
Hadoop Family.Mahout.
大数据名词速览
Hadoop Family | Cloud Computing 名词速览.
Hadoop生态圈
如何用形象的比喻描述大数据的技术生态?Hadoop、Hive、Spark 之间是什么关系?
Hadoop Family Integration
Hadoop Family. Hadoop 生态圈概览。
Hadoop Ecosystem Evolves-10 Cool Big Data Projects
In the 10 years since developers created Hadoop to wrangle the challenges that came with big data, the ecosystem for these technologies has evolved. The Apache Software Foundation is teeming with open source big data technology projects. Here's a look at some significant projects, and a peek at some up-and-comers.
Hadoop Ecosystem Table
This page is a summary to keep the track of Hadoop related project, and relevant projects around Big Data scene focused on the open source, free software enviroment.转载至http://hadoopecosystemtable.github.io, Please Flolow
Hadoop or Spark
Which Is The Best Big Data Framework? - Forbes.
hackerrank_numpy_test
become an hacker need to parctice.
大数据
Hadoop 开机自启动
设置Hadoop程序开机自启动的方法
Zeppelin安装 快速指南
Zeppelin 安装
Spark安装 快速指南
Spark 安装
Hadoop 集群安装 快速指南
Hadoop 安装
kafka组件深度解析
kafka组件深度解析
Hadoop Tutorial ---- 学习安排
Hive与Spark学习提纲
Hadoop Tutorial ---- Spark入门实战系列--1.Spark及其生态圈简介
Spark入门实战系列--1.Spark及其生态圈简介
培训虚拟机使用指南
使用指南
Hadoop Tutorial ---- Components Required
大数据学习与培训,Let‘s work together!
Hadoop Mesos
mesos和yarn区别
Hadoop Learn Guide
Guide of Hadoop Ecosystem.
Hadoop Resource
Limited site of Hadoop Resource.
HadoopMapReduce
Simplified Data Processing on Large Clusters
Hadoop Mahout
Hadoop Family.Mahout.
大数据名词速览
Hadoop Family | Cloud Computing 名词速览.
Hadoop生态圈
如何用形象的比喻描述大数据的技术生态?Hadoop、Hive、Spark 之间是什么关系?
Hadoop Family Integration
Hadoop Family. Hadoop 生态圈概览。
Hadoop Ecosystem Evolves-10 Cool Big Data Projects
In the 10 years since developers created Hadoop to wrangle the challenges that came with big data, the ecosystem for these technologies has evolved. The Apache Software Foundation is teeming with open source big data technology projects. Here's a look at some significant projects, and a peek at some up-and-comers.
Hadoop Ecosystem Table
This page is a summary to keep the track of Hadoop related project, and relevant projects around Big Data scene focused on the open source, free software enviroment.转载至http://hadoopecosystemtable.github.io, Please Flolow
Hadoop or Spark
Which Is The Best Big Data Framework? - Forbes.
hackerrank_numpy_test
become an hacker need to parctice.
IT
Flask应用
flask my first website
Flask应用
flask my first website
30分钟编写一个Flask应用
flask; 使用Stormpath来创建并管理用户账户和数据
断网环境下利用pip安装Python离线安装包
离线断网环境下安装Python包,配置环境
远程访问jupyter notebook
默认只能在本地访问,如果想把它安装在服务器上,然后在本地远程访问,则需要进行如下配置
Vim常用命令
归纳,方便自查
Setup passphraseless ssh
配置免密码登陆
consistent hashing
转载请说明出处-http://blog.csdn.net/cywosp/article/details/23397179
scala with sublime
compile and execute with sublime.
sublime keyboard shortcuts
keyboard shortcuts of sublime.
机器学习
LDA
LDA的通俗解释from quora
Scala
Scala Featured
scala中"_"的用法总结
Scala Lesson 16
Databricks Scala 编程风格指南
Scala Lesson 15
Spark With Scala
Scala Lesson 14
Scala 提取器(Extractor)
Scala Lesson 13
Scala 异常处理
Scala Lesson 12
Scala 正则表达式
Scala Lesson 11
Scala 模式匹配 scala-pattern-matching
Scala Lesson 9
Scala 文件 I/O
Scala Lesson 8
Scala 类和对象 Scala 继承
Scala Lesson 10
Scala Trait(特征)
Scale tuple
Scala 元组
Scala Set
Scala Set(集合)
Scala Option
Scala Option(选项)
Scala Map
Scala Map(映射)
Scala List
Scala List(列表)
scala Iterators
Scala Iterator(迭代器)
scala Lesson 7
Scala Collection
scala Lesson 6
Scala Arrays
scala Lesson 5
Scala Strings
scala Lesson 4
Scala 函数
scala Lesson 3
IF...ELSE 语句,循环判断,
scala Lesson 2
Scala 访问修饰符,运算符
scala Lesson 1
变量函数,操作符,基本类型,Scala包
scala with sublime
compile and execute with sublime.
CFAL2
Lev2 Quant 1
The very basic of Alternative Investment. There you go, only Finace surive.
Deep Learning
THE NEURAL NETWORK ZOO
neural-network-zoo
Travel
travel site
Hotel
Kafka
Python 2 Kafka 2 SparkStreaming
Python 2 Kafka 2 SparkStreaming
Tools
DataMing Tools Collected
DataMing Tools Collected
tmp
tmp
Glossary
Lev2 Glossary
Glossary for review.
English
词根W-Z
Learning 词根
词根V
Learning 词根
词根U
Learning 词根
词根T
Learning 词根
词根S
Learning 词根
词根R
Learning 词根
词根Q
Learning 词根
词根P
Learning 词根
词根O
Learning 词根
词根N
Learning 词根
词根M
Learning 词根
词根L
Learning 词根
词根K
Learning 词根
词根J
Learning 词根
词根I
Learning 词根
词根H
Learning 词根
词根G
Learning 词根
词根F
Learning 词根
词根E
Learning 词根
词根D
Learning 词根
词根C
Learning 词根
词根B
Learning 词根
词根A
Learning 词根
wordcloud
新三板文本项目
新三板POC文本项目说明
新三板聚类项目
新三板聚类Demo
wordcloud
One of the most powerful things we can do with text is find ways to visually represent the information it is expressing. Word clouds are one of the best methods of achieving this.
quant
截至2017中国智能投顾产品总览
截至2017中国智能投顾产品总览; Digital asset allocation; Robo-Advisor;
Zipline量化平台
方案备选一
量化平台ABC
量化平台ABC
python
yh模型开发技术选型
确定合适的开发语言、平台等
量化平台ABC
量化平台ABC
work
yh模型开发技术选型
确定合适的开发语言、平台等
r
yh模型开发技术选型
确定合适的开发语言、平台等
scala
yh模型开发技术选型
确定合适的开发语言、平台等
故障预测
Fault detection and diagnosis Approaches
故障预测与故障诊断
Clustering
Spark-ML-0501-Clustering-KMeans
K-means is one of the most commonly used clustering algorithms that clusters the data points into a predefined number of clusters.
Spark-ML-0502-Clustering-Gaussian mixture
高斯混合模型
Spark-ML-05-Clustering
Clustering is an unsupervised learning problem whereby we aim to group subsets of entities with one another based on some notion of similarity. Clustering is often used for exploratory analysis and/or as a component of a hierarchical supervised learning pipeline (in which distinct classifiers or regression models are trained for each cluster).
finance
截至2017中国智能投顾产品总览
截至2017中国智能投顾产品总览; Digital asset allocation; Robo-Advisor;
bigdata
Mahout Demo
mahout
mahout
Mahout Demo
mahout
CNN
如何用Python和深度神经网络识别图像?[转录]
flask my first website
ML
Market-Regimes with a Hidden-Markov Model
马尔可夫模型和隐马尔可夫模型、市场风格转换识别
数据挖掘,你我常常忽略的小问题
数据挖掘有哪些值得注意的小细节
数据挖掘,你我常常忽略的小问题
数据挖掘有哪些值得注意的小细节
如何用Python和深度神经网络识别图像?[转录]
flask my first website
Graphx
Graphx operation 2[转]
Grapregel 与 spark graphX 的 pregel api
Graphx operation 1
Graph Operators
Graphx prelimery
graphx-programming-guide
smtplib
如何用Python发送邮件
使用python邮件模块自动发送HTML格式邮件
email
如何用Python发送邮件
使用python邮件模块自动发送HTML格式邮件
bottle
如何用Python发送邮件
使用python邮件模块自动发送HTML格式邮件
FDA
ANS1
TensorFlow安装记录