圖書標籤: 分布式 大數據 係統設計 計算機 architecture 架構 data O'Reilly
发表于2024-06-20
Designing Intensive Applications pdf epub mobi txt 電子書 下載 2024
Data is at the center of many challenges in system design today. Difficult issues need to be figured out, such as scalability, consistency, reliability, efficiency, and maintainability. In addition, we have an overwhelming variety of tools, including relational databases, NoSQL datastores, stream or batch processors, and message brokers. What are the right choices for your application? How do you make sense of all these buzzwords?
In this practical and comprehensive guide, author Martin Kleppmann helps you navigate this diverse landscape by examining the pros and cons of various technologies for processing and storing data. Software keeps changing, but the fundamental principles remain the same. With this book, software engineers and architects will learn how to apply those ideas in practice, and how to make full use of data in modern applications.
Peer under the hood of the systems you already use, and learn how to use and operate them more effectively
Make informed decisions by identifying the strengths and weaknesses of different tools
Navigate the trade-offs around consistency, scalability, fault tolerance, and complexity
Understand the distributed systems research upon which modern databases are built
Peek behind the scenes of major online services, and learn from their architectures
Martin is a researcher in distributed systems at the University of Cambridge. Previously he was a software engineer and entrepreneur at Internet companies including LinkedIn and Rapportive, where he worked on large-scale data infrastructure. In the process he learned a few things the hard way, and he hopes this book will save you from repeating the same mistakes.
Martin is a regular conference speaker, blogger, and open source contributor. He believes that profound technical ideas should be accessible to everyone, and that deeper understanding will help us develop better software.
我靠,這本書實在太牛瞭。 趕緊讀!趕緊讀!趕緊讀!
評分挺適閤準備係統設計麵試的,twitter的pull, push模型,database sharding 和 replication都講得比較清楚
評分隻看瞭免費的前四章,感覺都是常識啊
評分全麵、深入的大數據係統介紹,不但有how,還有why,建立技術全景。
評分強烈推薦。行文流暢,深入淺齣,分布式係統的各種工程實踐梳理得很清晰。
版权归作者所有,任何形式转载请联系作者。 作者:荒城梦(来自豆瓣) 来源:https://www.douban.com/note/725242700/ 陆陆续续有几个月过去了,终于把这本“鸿篇巨制”读完了。本书在计算机类著作里并不算特别厚,说鸿篇巨制是因为随着越往后读越发觉得此书内容之广度与深度已...
評分1.数据模型 从数据库的数据模型上,可以将数据库分为关系型、文档型以及图型数据库等。这三类数据库在形式和使用的场景上各有不同。 1.1.关系型 关系型模型基于集合论中的关系理论,将数据视为Tuple的集合。关系型在1970年被提出,人们一开始质疑这类技术是否能高效地实现,但...
評分本书开头提到“当今很多新型应用都属于数据密集型(intensive)而不是计算密集型(compute-intensive)” 当今机器学习越来越普及的情况下其实用户应用后面基础件层的compute-intensive应用越来越多了。“很可惜,让鄙人日常头秃都是 compute-intensive的,啥时候有一本De...
評分这本书从数据的产生、传输、存储、检索、更新等方方面面,介绍了一个后台系统使用到的种种存储技术,包括了数据库、缓存、检索引擎、批处理、流处理。对于每一种技术,作者介绍了它们的基本原理、使用场景、优势和不足,并附带了大量的扩展阅读书目。对于后台开发、运维人员,...
評分每一个数据工程师和产品经理都应该把最后一节,Doing the right thing,读十遍。 为了说服大家为什么值得读十遍,我摘抄几句: automated systems can systematically and arbitrarily exclude a person from participating in society without any proof of guilt, and with l...
Designing Intensive Applications pdf epub mobi txt 電子書 下載 2024