大数据自学技术清单:
1.Linuxshell
2.分布式架构与存储redis,elasticsearch等
3.大数据基础组件Hadoop,zookeeper,flume,kafka,elasticsearch等
4.机器学习
5.流式处理技术
Spark(scala,spark,sparkcore,sparksql,sparkstreaming,sparkmllib,sparkgraphx)
6.基础编程Python,Java
7.虚拟化docker,kvm,openstack