About
Articles by Dean
Activity
5K followers
Experience & Education
Licenses & Certifications
-
-
-
Principles of Reactive Programming
Coursera
Issued -
-
-
-
-
-
Certified Cloudera hadoop Professional
Cloudera, Inc.
Issued -
Publications
-
Hardware > Software > Process
NVIDIA/Manning
An exploration of the changing landscape of hardware, software, and development processes and how they impact the development and performance of data-intensive applications.
Other authorsSee publication -
Programming Scala, Third Edition
O'Reilly Media
See publicationThe third edition of this classic, thorough introduction to Scala now covers Scala 3, which brings dramatic improvements to Scala, one of the most powerful programming languages available today.
-
Fast Data Architectures for Streaming Applications, Second Edition
O'Reilly Media
See publicationFast Data Architectures for Streaming Applications, Second Edition is a free report co-published by Lightbend and O'Reilly on the architectural characteristics of highly available, resilient, scalable, and responsive systems for data stream processing at scale. Originally published in October, 2016, the second edition was published in October, 2018.
The book provides an overview of the common requirements for reliable streaming systems, based on common use cases for streaming, such as…Fast Data Architectures for Streaming Applications, Second Edition is a free report co-published by Lightbend and O'Reilly on the architectural characteristics of highly available, resilient, scalable, and responsive systems for data stream processing at scale. Originally published in October, 2016, the second edition was published in October, 2018.
The book provides an overview of the common requirements for reliable streaming systems, based on common use cases for streaming, such as serving machine learning in a streaming context. Other requirements include the need to handle potential data loss, duplication of data, late arrival, etc. Standard system concerns, the so called reactive principles are important for such systems, where services and applications must run reliably for weeks, months, even years. If you run anything long enough, it will see every rare anomaly: hardware failures, network partitions, traffic spikes, etc. Hence, streaming systems have harder operational requirements than shorter-lived batch processes
Apache Kafka is the messaging backbone of these architectures, providing high scalability and reliability for ingesting data organized into topics and orders, similar to conventional message queues.
The data can then be processed by one or more stream processors. I discuss four, Apache Spark, Apache Flink, Akka Streams, and Kafka Streams, along with the characteristics you should consider when picking one or more engines.
Rounding out the picture are tools for building other microservices, such as the Lightbend Platform, including management and monitoring tools. -
Programming Scala, Second Edition
O'Reilly Media
See publicationGet up to speed on Scala, the JVM language that offers all the benefits of a modern object model, functional programming, and an advanced type system. Packed with code examples, this comprehensive book shows you how to be productive with the language and ecosystem right away, and explains why Scala is ideal for today's highly scalable, data-centric applications that support concurrency and distribution.
This second edition covers recent language features, with new chapters on pattern…Get up to speed on Scala, the JVM language that offers all the benefits of a modern object model, functional programming, and an advanced type system. Packed with code examples, this comprehensive book shows you how to be productive with the language and ecosystem right away, and explains why Scala is ideal for today's highly scalable, data-centric applications that support concurrency and distribution.
This second edition covers recent language features, with new chapters on pattern matching, comprehensions, and advanced functional programming. You’ll also learn about Scala’s command-line tools, third-party tools, libraries, and language-aware plugins for editors and IDEs. This book is ideal for beginning and advanced Scala developers alike. -
Programming Hive
O'Reilly Media
A comprehensive introduction to Hive, the SQL tool for the Hadoop ecosystem.
Other authors -
-
Functional Programming for Java Developers
O'Reilly Media
See publicationA gentle introduction to the core principles of Functional Programming aimed at Java Developers, with an emphasis on why FP is so important.
-
Programming Scala
O'Reilly Media
Programming Scala introduces an exciting new language that offers all the benefits of a modern object model, functional programming, and an advanced type system. Packed with code examples, this comprehensive book teaches you how to be productive with Scala quickly, and explains what makes this language ideal for today's highly scalable, component-based applications that support concurrency and distribution. You'll also learn the advantages that Scala offers as a language for the Java Virtual…
Programming Scala introduces an exciting new language that offers all the benefits of a modern object model, functional programming, and an advanced type system. Packed with code examples, this comprehensive book teaches you how to be productive with Scala quickly, and explains what makes this language ideal for today's highly scalable, component-based applications that support concurrency and distribution. You'll also learn the advantages that Scala offers as a language for the Java Virtual Machine.
Other authorsSee publication
Projects
-
Strata 2012 - Hadoop Data Warehousing with Hive
-
See projectHands-on tutorial on Hive for Hadoop-based data warehousing.
Languages
-
French
Limited working proficiency
Organizations
-
ACM
-
Recommendations received
8 people have recommended Dean
Join now to viewOther similar profiles
Explore top content on LinkedIn
Find curated posts and insights for relevant topics all in one place.
View top content