本站提供 8500 多本免费的 IT 技术电子书在线下载。
  1. 文章总数:8391
  2. 浏览总数:327,430
  3. 评论:0
  4. 分类目录:125 个
  5. 注册用户数:29
  6. 最后更新:2019年11月22日
过往记忆博客公共帐号iteblog_hadoop
欢迎关注微信公共帐号:
iteblog_hadoop

Mastering Spark with R

数据分析 iteblog 104℃ 0评论

关注 过往记忆大数据 微信公众号,回复 8530 获取本书下载地址。

子标题:The Complete Guide to Large-Scale Analysis and Modeling

Mastering Spark with R
作者:
Javier Luraschi, Kevin Kuo , Edgar Ruiz
ISBN-10:
149204637X
出版年份:
2019
页数:
296
语言:
English
文件大小:
5.4 MB
文件格式:
EPUB

图书描述

If you’re like most R users, you have deep knowledge and love for statistics. But as your organization continues to collect huge amounts of data, adding tools such as Apache Spark makes a lot of sense. With this practical book, data scientists and professionals working with large-scale data applications will learn how to use Spark from R to tackle big data and big compute problems.

Authors Javier Luraschi, Kevin Kuo, and Edgar Ruiz show you how to use R with Spark to solve different data analysis problems. This book covers relevant data science topics, cluster computing, and issues that should interest even the most advanced users.

  • Analyze, explore, transform, and visualize data in Apache Spark with R
  • Create statistical models to extract information and predict outcomes; automate the process in production-ready workflows
  • Perform analysis and modeling across many machines using distributed computing techniques
  • Use large-scale data from multiple sources and different formats with ease from within Spark
  • Learn about alternative modeling frameworks for graph processing, geospatial analysis, and genomics at scale
  • Dive into advanced topics including custom transformations, real-time data processing, and creating custom Spark extensions

下载地址

关注 过往记忆大数据 微信公众号,回复 8530 获取本书下载地址。

如图书无法下载,请加微信 fangzhen0219 反馈。
喜欢 (2)or分享 (0)
发表我的评论
取消评论

表情
本博客评论系统带有自动识别垃圾评论功能,请写一些有意义的评论,谢谢!