Hadoop: The Definitive Guide, 4th Edition
Hadoop: The Definitive Guide, 4th Edition
Get ready to unlock the power of your data. With the fourth edition of this comprehensive guide, you’ll learn how to build and maintain reliable, scalable, distributed systems with Apache Hadoop. This book is ideal for programmers looking to analyze datasets of any size, and for administrators who want to set up and run Hadoop clusters.
Using Hadoop 2 exclusively, author Tom White presents new chapters on YARN and several Hadoop-related projects such as Parquet, Flume, Crunch, and Spark. You’ll learn about recent changes to Hadoop, and explore new case studies on Hadoop’s role in healthcare systems and genomics data processing.
Learn fundamental components such as MapReduce, HDFS, and YARN
Explore MapReduce in depth, including steps for developing applications with it
Set up and maintain a Hadoop cluster running HDFS and MapReduce on YARN
Learn two data formats: Avro for data serialization and Parquet for nested data
Use data ingestion tools such as Flume (for streaming data) and Sqoop (for bulk data transfer)
Understand how high-level data processing tools like Pig, Hive, Crunch, and Spark work with Hadoop
Learn the HBase distributed database and the ZooKeeper distributed configuration service
Java SE 8 for the Really Impatient
Eagerly anticipated by millions of programmers, Java SE 8 is the most important Java update in many years. Now, internationally renowned Java expert Cay S. Horstmann ("Core Java") concisely introduces Java 8's most valuable new features (plus a few Java 7 innovations that haven't gotten the attention they deserve). If you're an experienced Java programmer, Horstmann's practical insights and sample code will help you quickly take advantage of these and other Java language and platform improvements: * Using Lambda expressions (closures) to write computation "snippets" that can be passed to common utility functions, making Java collections far more flexible * Major updates to concurrent programming that make use of lambda expressions (filter/map/reduce) and improve Java's fork-join framework * The long-awaited introduction of a well-designed date/time/calendar library (JSR 310) * Annotations on Java types to support useful pluggable type checkers and refine Java's built-in type system * Java/JavaScript integration via Project Nashorn * Transition to the JavaFX API as the official way to build media-rich, highly interactive client-side Java applications * Simpler tuning of HotSpot JVM garbage collection The first book to cover all these highly-anticipated improvements, Cay Horstmann's Java SE 8 for the Really Impatient will be indispensable to everyone who wants to write tomorrow's most robust, efficient, and secure Java code.
Effective Java 3rf Edition
Java has changed dramatically since the previous edition of Effective Java was published shortly after the release of Java 6. This Jolt award-winning classic has now been thoroughly updated to take full advantage of the latest language and library features. The support in modern Java for multiple paradigms increases the need for specific best-practices advice, and this book delivers.
As in previous editions, each chapter of Effective Java, Third Edition, consists of several “items,” each presented in the form of a short, stand-alone essay that provides specific advice, insight into Java platform subtleties, and updated code examples. The comprehensive descriptions and explanations for each item illuminate what to do, what not to do, and why.
The third edition covers language and library features added in Java 7, 8, and 9, including the functional programming constructs that were added to its object-oriented roots. Many new items have been added, including a chapter devoted to lambdas and streams.
New coverage includes
Functional interfaces, lambda expressions, method references, and streams
Default and static methods in interfaces
Type inference, including the diamond operator for generic types
The @SafeVarargs annotation
The try-with-resources statement
New library features such as the Optional interface, java.time, and the convenience factory methods for collections
Elasticsearch Blueprints
Elasticsearch is a distributed search server similar to Apache Solr with a focus on large datasets, schemaless setup, and high availability. Utilizing the Apache Lucene library (also used in Apache Solr), Elasticsearch enables powerful full-text searches, autocomplete, the "morelikethis" search, multilingual functionality, as well as an extensive search query DSL.
Elasticsearch's schemafree architecture provides developers with built-in flexibility as well as ease of setup. This architecture allows Elasticsearch to index and search unstructured content, making it perfectly suited for both small projects and large big data warehouses—even with petabytes of unstructured data.
This book will enable you to utilize the amazing features of Elasticsearch and build projects to simplify operations on even large datasets. This book starts with the creation of a Google-like web search service, enabling you to generate your own search results. You will then learn how an e-commerce website can be built using Elasticsearch, which will help users search and narrow down the set of products they are interested in. You will explore the most important part of a search—relevancy—based on the various parameters, such as relevance, document collection relevance, user usage pattern, geographic nearness, and document relevance to select the top results.
Next, you will discover how Elasticsearch manages relational content for even complex real-world data. You will then learn the capabilities of Elasticsearch as a strong analytic search platform, which coupled with some visualization techniques can produce real-time data visualization. You will also discover how to improve your search quality and widen the scope of matches using various analyzer techniques. Finally, this book will cover the various geo capabilities of Elasticsearch to make your searches similar to real-world scenarios.
ElasticSearch 5.0 Cookbook
Elasticsearch is one of the most powerful solution, written with the cloud and distributed
computing in mind. Its main author, Shay Banon, famous for having developed Compass
(http://www.compass-project.org), released the first version of Elasticsearch in March 2010.
Thus the main scope of Elasticsearch is to be a search engine; it also provides a lot of features
that allows using it also as data-store and analytic engine via its aggregation framework.
Elasticsearch contains a lot of innovative features: JSON REST based, natively distributed in a
map/reduce approach for both search and analytics, easy to set up and extensible with plugins.
From 2010 when it started to be developed, to last version (5.x) there is a big evolution of the
product becoming one of the most used datastore for a lot of markets. In this book we will go
in depth on these changes and features and many others capabilities available in Elasticsearch.
Elasticsearch is also a product in continuous evolution and new functionalities are released both
by the Elasticsearch Company (the company founded by Shay Banon to provide commercial
support for Elasticsearch) and by Elasticsearch users as plugin (mainly available on GitHub).
Today a lot of the major world players in IT industry (see some use cases at
https://www.elastic.co/use-cases) are using Elasticsearch for its simplicity and advanced
features.
Elasticsearch Tutorial
Elasticsearch is a search engine based on Lucene. It provides a distributed, multitenant-capable full-text search engine with an
HTTP web interface and schema-free JSON documents. Elasticsearch is developed in Java and is released as open source under
the terms of the Apache License. Elasticsearch is the most popular enterprise search engine followed by Apache Solr, also based
on Lucene.
Elasticsearch can be used to search all kinds of documents. It provides scalable search, has near real-time search, and supports
multitenancy. Elasticsearch is distributed, which means that indices can be divided into shards and each shard can have zero
or more replicas. Each node hosts one or more shards, and acts as a coordinator to delegate operations to the correct shard(s).
Rebalancing and routing are done automatically. Related data is often stored in the same index, which consists of one or more
primary shards, and zero or more replica shards. Once an index has been created, the number of primary shards cannot be
changed. (Source: https://en.wikipedia.org/wiki/Elasticsearch)
In this ebook, we provide a series of tutorials so that you can develop your own Elasticsearch based applications. We cover a
wide range of topics, from installation and operations, to Java API Integration and reporting. With our straightforward tutorials,
you will be able to get your own projects up and running in minimum time.
programming-openresty
OpenResty itself has been relying on automated testing to remain high quality over the
years. As OpenResty core developers, we embrace the test driven development (TDD)
process all the time. An excellent result of our TDD practices over the years is a huge set of
test suites for all the OpenResty co
Bash notes for professionals
This Bash Notes for Professionals book is compiled from Stack Overflow
Documentation, the content is written by the beautiful people at Stack Overflow.
Text content is released under Creative Commons BY-SA, see credits at the end
of this book whom contributed to the various chapters. Images may be copyright
of their respective owners unless otherwise specified
MongoDB Basics
Need a quick and easy to understand introduction to MongoDB and NoSQL databases? MongoDB Basics, from The Definitive Guide to MongoDB, 2E, shows you how a document-oriented database system differs from a relational database, and how to install and get started using it. You'll also learn MongoDB design basics, including geospatial indexing, how to navigate, view, and query your database, and how to use GridFS with a bit of Python.
What you’ll learn
What sets MongoDB apart from other databases
How to install MongoDB on all major platforms
How to design a MongoDB database
How to work with GridFS
Who this book is for
Developers and relational database admins who need to get up to speed quickly on what MongoDB is, how it works, and how to use it.
Table of Contents
Ch. 1: Introduction to MongoDB
Ch. 2: Installing MongoDB
Ch. 3: The Data Model
Ch. 4: Working with Data
Ch. 5: GridFS
HTML5 and CSS3 Develop with Tomorrow's Standards Today
HTML5 and CSS3 are the future of web development, but you don't have to wait to start using them. Even though the specification is still in development, many modern browsers and mobile devices already support HTML5 and CSS3. This book gets you up to speed on the new HTML5 elements and CSS3 features you can use right now, and backwards compatible solutions ensure that you don't leave users of older browsers behind.
Beginning HTML5 and CSS3
Beginning HTML5 and CSS3 is your introduction to the new features and elements of HTML5 - all the leaner, cleaner, and more efficient code you've hoped for is available now with HTML5, along with some new tools that will allow you to create more meaningful and richer content. For everyone involved in web design, this book also introduces the new structural integrity and styling flexibility of CSS 3 - which means better-looking pages and smarter content in your website projects.
HTML5 and JavaScript Web Apps
This hands-on book shows you how to work with HTML5, JavaScript MVC frameworks, and the latest W3C specifications to build mobile and desktop web apps that are widely supported across all browsers and devices. You'll quickly master building client-side applications with a loosely coupled backend infrastructure that supports offline clients.
Learn how to incorporate web storage, web workers, geolocation, Device Orientation, and WebSockets into your application architecture, using real-world examples. If you're familiar with server-side programming and understand the basics of HTML5, this book is for you.
Google Maps API, 2nd Edition
This is the second edition of this Friday, which has been rewritten to use the Google Maps V2 API.
The Definitive Guide to MongoDB
MongoDB, a cross-platform NoSQL database, is the fastest-growing new database in the world. MongoDB provides a rich document-oriented structure with dynamic queries that you’ll recognize from RDBMS offerings such as MySQL. In other words, this is a book about a NoSQL database that does not require the SQL crowd to re-learn how the database world works!
MongoDB has reached 1.0 and boasts 50,000+ users. The community is strong and vibrant and MongoDB is improving at a fast rate. With scalable and fast databases becoming critical for today’s applications, this book shows you how to install, administer and program MongoDB without pretending SQL never existed.
What you’ll learn
Set up MongoDB on all major platforms, including Linux and Cloud platforms
Model data within a document-oriented database
Work with your data using non-SQL commands
Write applications using MongoDB's PHP API
Optimize MongoDB
Master MongoDB administration, including replication
MongoDB in Action
Databases are the workhorses of the information age. Like Atlas, they go largely unnoticed in supporting the digital world we’ve come to inhabit. It’s easy to forget that our digital interactions, from commenting and tweeting to searching and sorting, are in essence interactions with a database. Because of this fundamental yet hidden function, I always experience a certain sense of awe when thinking about databases, not unlike the awe one might feel when walking across a suspension bridge normally reserved for automobiles.
The database has taken many forms. The indexes of books and the card catalogs that once stood in libraries are both databases of a sort, as are the ad hoc structured text files of the Perl programmers of yore. Perhaps most recognizable now as databases proper are the sophisticated, fortune-making relational databases that underlie much of the world’s software. These relational databases, with their idealized thirdnormal forms and expressive SQL interfaces, still command the respect of the old guard, and appropriately so.
But as a working web application developer a few years back, I was eager to sample the emerging alternatives to the reigning relational database. When I discovered MongoDB, the resonance was immediate. I liked the idea of using a JSON-like structure to represent data. JSON is simple, intuitive, human-friendly. That MongoDB also based its query language on JSON lent a high degree of comfort and harmony to the usage of this new database. The interface came first. Compelling features like easy replication and sharding made the package all the more intriguing. And by the time I’d built a few applications on MongoDB and beheld the ease of development it imparted, I’d become a convert.
Async [removed] Build More Responsive Apps with Less Code,
Originally devised to enhance web pages in Netscape 2.0, JavaScript is now faced with being a single-threaded language in a multimedia, multitasking, multicore world. Yet JavaScript has not only persevered since 1995, it’s thrived. One after the other, potential rivals in the browser—Flash, Silverlight, and Java applets, to name a few—have come and (more or less) gone. Meanwhile, when a programmer named Ryan Dahl wanted to build a new framework for event-driven servers, he searched the far reaches of computer science for a language that was both dynamic and single-threaded before realizing that the answer was right in front of him. And so, Node.js was born, and JavaScript became a force to be reckoned with in the server world. How did this happen? As recently as 2001, Paul Graham wrote the following in his essay “The Other Road Ahead”:
I would not even use JavaScript, if I were you… Most of the JavaScript I see on the Web isn’t necessary, and much of it breaks.
Today, Graham is the lead partner at Y Combinator, the investment group behind Dropbox, Heroku, and hundreds of other start-ups—nearly all of which use JavaScript. As he put it in a revised version of the essay, “JavaScript now works.”
When did JavaScript become a respectable language? Some say the turning point was Gmail (2004), which showed the world that with a heavy dose of Ajax you could run a first-class email client in the browser. Others say that it was jQuery (2006), which abstracted the rival browser APIs of the time to create a de facto standard. (As of 2011, 48 percent of the top 17,000 websites use jQuery.)
CSS3 The Missing Manual 3rd
CSS3 lets you create professional-looking websites, but learning its finer points can be tricky—even for seasoned web developers. This Missing Manual shows you how to take your HTML and CSS skills to the next level, with valuable tips, tricks, and step-by-step instructions. You’ll quickly learn how to build web pages that look great and run fast on devices and screens of all sizes.
The important stuff you need to know:
Start with the basics. Write CSS3-friendly HTML, including the HTML5 tags recognized by today’s browsers.
Apply real-world design. Format text, create navigation tools, and enhance pages with graphics.
Make your pages lively. Create eye-catching animations and give your visitors attractive tables and forms.
Take control of page layouts. Use professional design techniques such as floats and positioning.
Look great on any device. Craft websites that adapt to desktop, tablet, and mobile browsers.
Get advanced techniques. Use CSS3 more effectively and efficiently, and ensure that your web pages look good when printed.
Spring In Action 第三版
Spring In Action 第三版
HTML5 and CSS3 Seventh Edition Visual QuickStart Guide.Dec.2011
Want to learn how to build Web sites fast? This best-selling guide’s visual format and step-by-step, task-based instructions will have you up and running with HTML5 and CSS3 in no time. This Seventh Edition is a major revision, with approximately 125 pages added and substantial updates to (or complete rewrites of) nearly every page from the preceding edition. Authors Elizabeth Castro and Bruce Hyslop use clear instructions, friendly prose, and real-world code samples to teach you HTML and CSS from the ground up. Over the course of 21 chapters you will learn how to:
Write semantic HTML, both with elements that have been around for years and ones that are new in HTML5.
Prepare images for the Web and add them to your pages.
Use CSS to style text, add background colors and images, and implement a multicolumn layout.