圖書標籤: Hadoop 大數據 BigData 計算機 分布式 hadoop 機器學習 O'Reilly
发表于2024-06-21
Hadoop: The Definitive Guide pdf epub mobi txt 電子書 下載 2024
Get ready to unlock the power of your data. With the fourth edition of this comprehensive guide, you’ll learn how to build and maintain reliable, scalable, distributed systems with Apache Hadoop. This book is ideal for programmers looking to analyze datasets of any size, and for administrators who want to set up and run Hadoop clusters.
Using Hadoop 2 exclusively, author Tom White presents new chapters on YARN and several Hadoop-related projects such as Parquet, Flume, Crunch, and Spark. You’ll learn about recent changes to Hadoop, and explore new case studies on Hadoop’s role in healthcare systems and genomics data processing.
Learn fundamental components such as MapReduce, HDFS, and YARN
Explore MapReduce in depth, including steps for developing applications with it
Set up and maintain a Hadoop cluster running HDFS and MapReduce on YARN
Learn two data formats: Avro for data serialization and Parquet for nested data
Use data ingestion tools such as Flume (for streaming data) and Sqoop (for bulk data transfer)
Understand how high-level data processing tools like Pig, Hive, Crunch, and Spark work with Hadoop
Learn the HBase distributed database and the ZooKeeper distributed configuration service
Tom White has been an Apache Hadoop committer since February 2007, and is a member of the Apache Software Foundation. He works for Cloudera, a company set up to offer Hadoop support and training. Previously he was as an independent Hadoop consultant, working with companies to set up, use, and extend Hadoop. He has written numerous articles for O'Reilly, java.net and IBM's developerWorks, and has spoken at several conferences, including at ApacheCon 2008 on Hadoop. Tom has a Bachelor's degree in Mathematics from the University of Cambridge and a Master's in Philosophy of Science from the University of Leeds, UK.
第四版全麵基於hadoop2,相比前版進行一些重要增添和順序調整,之前版本就不要看瞭。繼續那麼全麵而透徹實用。
評分閱讀瞭第1,2部分,算是對Hadoop有瞭基本的認知,接下來需要結閤實際項目夯實。其他相關的技術如Hive,HBase,Spark也需要去學習。
評分很全,主要是前兩部分,尤其mapreduce部分,後麵的那些cluster和各種相關項目的其實可以隻做瀏覽,講得也不是很細,用的時候看apache的說明文檔就好
評分T^T 買瞭很厚的影印版
評分前半段原理英文第四版,後半段相關項目和案例學習中文第三版就直接劃水劃過去瞭。Definitive Guide一貫作風,料多廢話也多,Hadoop也是復雜又難用,Spark要是革瞭你的命也是理所應當。
买了第一版,时间太紧,没来得及看,后来出了个号称修订升级的第二版,毫不犹豫又买了,后来听说第二版比第一版翻译得好,心中窃喜,再后来看了第二版,我震惊了,我TM就是一傻子,放着好好的英文版不看,赶什么时髦买中文版呢。在这个神奇的国度,牛奶里放的是三聚氰胺,火腿...
評分 評分 評分你的履历添了一笔<hadoop权威指南>译者,但是你不配 这是我见过的最不用心的翻译, 字里行间行文不通顺, 请别勉强自己,map reduce shuffle机制都没翻译的好 虽然原作者写作功底也实在是一般 第 1 2 5 6 7 这几章 翻译的实在是太烂了 请不要呐Google翻译糊弄人阿 误人子弟 ...
評分你的履历添了一笔<hadoop权威指南>译者,但是你不配 这是我见过的最不用心的翻译, 字里行间行文不通顺, 请别勉强自己,map reduce shuffle机制都没翻译的好 虽然原作者写作功底也实在是一般 第 1 2 5 6 7 这几章 翻译的实在是太烂了 请不要呐Google翻译糊弄人阿 误人子弟 ...
Hadoop: The Definitive Guide pdf epub mobi txt 電子書 下載 2024