Libros UCLV { BETA }

Gran cantidad de libros a nuestro alcance

Tenemos :
1413 libros,
263021 descargas y
1106 contribuyentes !

Se han encontrado 14 Coincidencias

Hadoop: The Definitive Guide

Big Data


47 Visitas | 83 Descargas | 2015-04-07 21:20:52 | cbustillo

Get ready to unlock the power of your data. With the fourth edition of this comprehensive guide, you'll learn how to build and maintain reliable, scalable, distributed systems with Apache Hadoop. This book is ideal for programmers looking to analyze datasets of any size, and for administrators who want to set up and run Hadoop clusters. Using Hadoop 2 exclusively, author Tom White presents new chapters on YARN and several Hadoop-related projects such as Parquet, Flume, Crunch, and Spark. You'll learn about recent changes to Hadoop, and explore new case studies on Hadoop's role in healthcare systems and genomics data processing.

Field Guide to Hadoop

Big Data


36 Visitas | 71 Descargas | 2015-04-07 21:30:07 | cbustillo

QR code - Field Guide to Hadoop Book Description If your organization is about to enter the world of big data, you not only need to decide whether Apache Hadoop is the right platform to use, but also which of its many components are best suited to your task. This field guide makes the exercise manageable by breaking down the Hadoop ecosystem into short, digestible sections. You'll quickly understand how Hadoop's projects, subprojects, and related technologies work together. Each chapter introduces a different topic - such as core technologies or data transfer - and explains why certain components may or may not be useful for particular needs. When it comes to data, Hadoop is a whole new ballgame, but with this handy reference, you'll have a good grasp of the playing field.

Hadoop RealWorld Solutions Cookbook

Hadoop


22 Visitas | 44 Descargas | 2015-11-02 15:15:52 | pecarrazana

This book uses concise code examples to highlight different types of real-world problems you can solve with Hadoop. It is designed for developers with varying levels of comfort using Hadoop and related tools. Hadoop beginners can use the recipes to accelerate the learning curve and see real-world examples of Hadoop application. For more experienced Hadoop developers, many of the tools and techniques might expose them to new ways of thinking or help clarify a framework they had heard of but the value of which they had not really understood.

Apache HBase Reference Guide

Apache HBase


16 Visitas | 31 Descargas | 2015-11-10 20:19:12 | pecarrazana

This is the official reference guide for the HBase version it ships with. Herein you will find either the definitive documentation on an HBase topic as of its standing when the referenced HBase version shipped, or it will point to the location in Javadoc or JIRA where the pertinent information can be found.

Professional Hadoop Solutions

Hadoop Solutions


28 Visitas | 35 Descargas | 2015-11-13 20:47:47 | pecarrazana

It’s not that there is a lack of books about Hadoop. Quite a few have been written, and many of them are very good. So, why this one? Well, when the authors started working with Hadoop, we wished there was a book that went beyond APIs and explained how the many parts of the Hadoop ecosystem work together and can be used to build enterprise-grade solutions. We were looking for a book that walks the reader through the data design and how it impacts implementation, as well as explains how MapReduce works, and how to reformulate specific business problems in MapReduce.We were looking for answers to the following questions: ? What are MapReduce’s strengths and weaknesses, and how can you customize it to better suit your needs? ? Why do you need an additional orchestration layer on top of MapReduce, and how does Oozie fit the bill? ? How can you simplify MapReduce development using domain-specific languages (DSLs)? ? What is this real-time Hadoop that everyone is talking about, what can it do, and what canit not do? How does it work?

Hadoop MapReduce v2 Cookbook - Second Edition

Hadoop MapReduce


31 Visitas | 65 Descargas | 2015-11-02 15:13:32 | pecarrazana

Source code can find here: https://nube.uclv.cu/public.php?service=files&t=46c9d61563370329b3a3aedc0176c431 If you are a big data enthusiast and wish to use Hadoop v2 to solve your problems, then this book is for you. This book is for Java programmers with little to moderate knowledge of Hadoop MapReduce. This is also a one-stop reference for developers and system admins who want to quickly get up to speed with using Hadoopv2. It would be helpful to have a basic knowledge of software development using Java and a basic working knowledge of Linux.

Apache Hive Essentials

Apache Hive


36 Visitas | 46 Descargas | 2015-11-13 20:52:40 | pecarrazana

Apache Hive Essentials prepares your journey to big data by covering the introduction of backgrounds and concepts in the big data domain along with the process of setting up and getting familiar with your Hive working environment in the first two chapters. In the next four chapters, the book guides you through discovering and transforming the value behind big data by examples and skills of Hive query languages. In the last four chapters, the book highlights well-selected and advanced topics, such as performance, security, and extensions as exciting adventures for this worthwhile big data journey.

Programming Hive

Apache Hive


35 Visitas | 54 Descargas | 2015-11-13 20:56:41 | pecarrazana

However, a challenge remains; how do you move an existing data infrastructure to Hadoop, when that infrastructure is based on traditional relational databases and the Structured Query Language (SQL)? What about the large base of SQL users, both expert database designers and administrators, as well as casual users who use SQL to extract information from their data warehouses? This is where Hive comes in. Hive provides an SQL dialect, called Hive Query Lan- guage (abbreviated HiveQL or just HQL) for querying data stored in a Hadoop cluster. SQL knowledge is widespread for a reason; it’s an effective, reasonably intuitive model for organizing and using data. Mapping these familiar data operations to the low-level MapReduce Java API can be daunting, even for experienced Java developers. Hive does this dirty work for you, so you can focus on the query itself. Hive translates most queries to MapReduce jobs, thereby exploiting the scalability of Hadoop, while presenting a familiar SQL abstraction. If you don’t believe us, see “Java Versus Hive: The Word Count Algorithm” on page 10 later in this chapter.

Learning Apache Mahout

Apache Mahout


20 Visitas | 44 Descargas | 2015-11-27 16:40:00 | pecarrazana

Learning Apache Mahout is aimed at providing a strong foundation in machine learning using Mahout. This book is ideal for learning the core concepts of machine learning and the basics of Mahout. This book will go from the basics of Mahout and machine learning, to feature engineering and the implementation of various machine learning algorithms in Mahout. Algorithm usage examples will be explained using both the Mahout command line and its Java API. We will conclude the book with two chapters of end-to-end case studies. Ideally, chapters 1, 2 and 3 should be read sequentially, chapters 4 to 8 in any order, and chapters 9 and 10 after chapter 1 to 8 have been completed.

Learning Apache Mahout Classification

Learning Apache Mahout Classification


19 Visitas | 33 Descargas | 2015-11-27 16:43:25 | pecarrazana

If you are a data scientist who has some experience with the Hadoop ecosystem and machine learning methods and want to try out classification on large datasets using Mahout, this book is ideal for you. Knowledge of Java is essential. To use the examples in this book, you should have the following software installed on your system: Java 1.6 or higher Eclipse Hadoop Mahout; we will discuss the installation in Chapter 2, Apache Mahout, of this book Maven, depending on how you install Mahout

Pig Design Patterns

Pig Design Patterns


19 Visitas | 30 Descargas | 2015-11-27 16:46:36 | pecarrazana

The Pig platform surely is one of these methods. Nevertheless, the power of such a platform is best tapped by extending it efficiently. Extending requires great familiarity of the platform. More importantly, extending is fun when the process of building such extensions is easy. The Pig Latin platform offers great simplicity. However, a practitioner's advice is immensely valuable in leveraging this simplicity to an enterprise's own requirement. This is where I find this book to be very apt. It makes you productive with the platform pretty quickly through very well-researched design patterns. This helps simplify programming in Hadoop and create complex end-to-end enterprise-grade Big Data solutions through a building block and best-pattern approach. This book covers the journey of Big Data from the time it enters the enterprise to its eventual use in analytics, either in the form of a dashboard or a predictive model.

Programming Pig

Programming Pig


26 Visitas | 35 Descargas | 2015-11-27 16:48:20 | pecarrazana

This book is intended for Pig programmers, new and old. Those who have never used Pig will find introductory material on how to run Pig and to get them started writing Pig Latin scripts. For seasoned Pig users, this book covers almost every feature of Pig: different modes it can be run in, complete coverage of the Pig Latin language, and how to extend Pig with your own User Defined Functions (UDFs). Even those who have been using Pig for a long time are likely to discover features they have not used before. Being a relatively young project, Pig has changed and grown significantly over the last four years. In that time we have released versions 0.1 through 0.9. This book assumes Pig 0.7 as the base version. Wherever features are only in versions 0.8 or 0.9, this is called out. The biggest change from 0.6 to 0.7 is that load and store function interfaces were rewritten, so Chapter 11 will not be usable by those on 0.6 or earlier versions. However, the rest of the book will still be applicable. Some knowledge of Hadoop will be useful for readers and Pig users. Appendix B pro- vides an introduction to Hadoop and how it works. “Pig on Hadoop” on page 1 walks through a very simple example of a Hadoop job. These sections will be helpful for those not already familiar with Hadoop. Small snippets of Java, Python, and SQL are used in parts of this book. Knowledge of these languages is not required to use Pig, but knowledge of Python and Java will be necessary for some of the more advanced features. Those with a SQL background may find “Comparing query and dataflow languages” on page 4 to be a helpful starting point in understanding the similarities and differences between Pig Latin and SQL.

ZooKeeper

Zookeeper


22 Visitas | 36 Descargas | 2015-11-27 16:51:48 | pecarrazana

Building distributed systems is hard. A lot of the applications people use daily, however, depend on such systems, and it doesn’t look like we will stop relying on distributed computer systems any time soon. Apache ZooKeeper has been designed to mitigate the task of building robust distributed systems. It has been built around core distributed computing concepts, with its main goal to present the developer with an interface that is simple to understand and program against, thus simplifying the task of building such systems. Even with ZooKeeper, the task is not trivial—which leads us to this book. This book will get you up to speed on building distributed systems with Apache ZooKeeper. We start with basic concepts that will quickly make you feel like you’re a distributed systems expert. Perhaps it will be a bit disappointing to see that it is not that simple when we discuss a bunch of caveats that you need to be aware of. But don’t worry; if you under? stand well the key issues we expose, you’ll be on the right track to building great dis? tributed applications.

MapReduce Design Patterns

MapReduce Design Patterns


29 Visitas | 50 Descargas | 2015-12-11 18:28:50 | pecarrazana

MapReduce is a computing paradigm for processing data that resides on hundreds of computers, which has been popularized recently by Google, Hadoop, and many others. The paradigm is extraordinarily powerful, but it does not provide a general solution to what many are calling “big data,” so while it works particularly well on some problems, some are more challenging. This book will teach you what problems are amenable to the MapReduce paradigm, as well as how to use it effectively. At first glance, many people do not realize that MapReduce is more of a framework than a tool. You have to fit your solution into the framework of map and reduce, which in some situations might be challenging. MapReduce is not a feature, but rather a con? straint. This makes problem solving easier and harder. It provides clear boundaries for what you can and cannot do, making the number of options you have to consider fewer than you may be used to. At the same time, figuring out how to solve a problem with con? straints requires cleverness and a change in thinking. Learning MapReduce is a lot like learning recursion for the first time: it is challenging to find the recursive solution to the problem, but when it comes to you, it is clear, concise, and elegant. In many situations you have to be conscious of system resources being used by the MapReduce job, especially inter-cluster network utilization. The tradeoff of being confined to the MapReduce framework is the ability to process your data with dis? tributed computing, without having to deal with concurrency, robustness, scale, and other common challenges. But with a unique system and a unique way of problem solving, come unique design patterns.