Big Data Memo

天行健,君子以自强不息;地势坤,君子以厚德载物。

Vim常用命令

1 切换输入模式 命令模式下 a (小写)光标之后进行插入 A (大写)光标所在行尾进行插入 i (小写)光标前开始进行插入 I (大写)光标所在行首进行插入 o (小写)...

Hadoop 开机自启动

1 把启动程序的命令添加到/etc/rc.d/rc.local 加入后的文件如下: #!/bin/sh # # This script will be executed *after* all the other init scripts. # You can put your own initialization stuff in here if you don't # want to...

Zeppelin安装 快速指南

Zeppelin Installation Stable binary package, please visit Apache Zeppelin download Page. Start Apache Zeppelin with a service manager zeppelin.server and port modify: 22 <property> 23 ...

Spark安装 快速指南

Prerequisites 3台虚拟机 SSH免密登陆 Java已安装 tools工具已经下载 Install Spark Step 1: Install Pip pip 是“A tool for installing and managing Python packages.”,也就是说pip是python的软件安装工具。使用root身份 cd /usr/local/src wget...

A Tale of Three Apache Spark APIs

Three Apache Spark APIs source In this blog, I explore three sets of APIs—RDDs, DataFrames, and Datasets—available in a pre-release preview of Apache Spark 2.0; why and when you should use each s...

SPSS for BigData Modeling

SPSS for BigData Modeling 1 Featured IBM Analytics products SPSS|Predictive analytics IBM SPSS predictive analytics software offers advanced techniques in an easy-to-use pack...

THE NEURAL NETWORK ZOO

THE NEURAL NETWORK ZOO

Lev2 Quant 1

Framework R9 Correlation and regression R10 Multiple regression and issues in regression analysis R11 Time-series analysis R12 Excerpt from “Probabilistic Approaches: Scenario Analysis, D...

Hadoop 集群安装 快速指南

Prerequisites 3台虚拟机 SSH免密登陆 Java已安装 Hadoop安装包已经下载 tools工具已经下载 配置 配置分配目录: 目录 说明 hdfs 存放hdfs相关文件,一般存放namenode、datanode、logs信息文件 logs ...

Setup passphraseless ssh

配置免密码登陆: 1、生成公钥 [hadoop@NN01 ~]$ ssh-keygen -t rsa -P '' -f ~/.ssh/id_rsa Generating public/private rsa key pair. Your identification has been saved in /home/hadoop/.ssh/id_rsa. Your public key h...