Package: hadoop-0.20 Version: 0.20.2+228-1~intrepid-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 44332 Depends: adduser, sun-java6-jre, sun-java6-bin Recommends: hadoop-0.20-native Provides: hadoop Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20_0.20.2+228-1~intrepid-cdh3b1_all.deb Size: 22275146 SHA256: c34a9590367efc28836b13db5c0c2ec4cf8d469a403dd89a6d4d9de7ec27c77a SHA1: 368d236e2d3a7b8e8950d1b7b9680739bcca188b MD5sum: 7a0c1cc551c809d4c517d101e677fbbe Description: A software platform for processing vast amounts of data Hadoop is a software platform that lets one easily write and run applications that process vast amounts of data. . Here's what makes Hadoop especially useful: * Scalable: Hadoop can reliably store and process petabytes. * Economical: It distributes the data and processing across clusters of commonly available computers. These clusters can number into the thousands of nodes. * Efficient: By distributing the data, Hadoop can process it in parallel on the nodes where the data is located. This makes it extremely rapid. * Reliable: Hadoop automatically maintains multiple copies of data and automatically redeploys computing tasks based on failures. . Hadoop implements MapReduce, using the Hadoop Distributed File System (HDFS). MapReduce divides applications into many small blocks of work. HDFS creates multiple replicas of data blocks for reliability, placing them on compute nodes around the cluster. MapReduce can then process the data where it is located. Package: hadoop-0.20-conf-pseudo Source: hadoop-0.20 Version: 0.20.2+228-1~intrepid-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 284 Depends: hadoop-0.20 (= 0.20.2+228-1~intrepid-cdh3b1) Provides: hadoop-conf-pseudo Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-conf-pseudo_0.20.2+228-1~intrepid-cdh3b1_all.deb Size: 163244 SHA256: e97af9553deb5c6ed21f3f0a1211b9dcdc855117b75db0f470dd37d098881935 SHA1: bb546a66b34c6300ebd52d48ff605ece123cae91 MD5sum: abc3492a90ab3aa68d9f2bad101a1313 Description: Pseudo-distributed Hadoop configuration Contains configuration files for a "pseudo-distributed" Hadoop deployment. In this mode, each of the hadoop components runs as a separate Java process, but all on the same machine. Package: hadoop-0.20-datanode Source: hadoop-0.20 Version: 0.20.2+228-1~intrepid-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 204 Depends: hadoop-0.20 (= 0.20.2+228-1~intrepid-cdh3b1) Provides: hadoop-datanode Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-datanode_0.20.2+228-1~intrepid-cdh3b1_all.deb Size: 155788 SHA256: ebe629999b07c333a5547b4e21d8a0ed9297dd8c29ddad9f9c858739b0ac07db SHA1: 41b684c2968baee3e9a96ca6095ac5df661dda67 MD5sum: c1ca7ecfc6b1ac3f35c1df8070b10c04 Description: Data Node for Hadoop The Data Nodes in the Hadoop Cluster are responsible for serving up blocks of data over the network to Hadoop Distributed Filesystem (HDFS) clients. Package: hadoop-0.20-doc Source: hadoop-0.20 Version: 0.20.2+228-1~intrepid-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 48456 Provides: hadoop-doc Homepage: http://hadoop.apache.org/core/ Priority: extra Section: doc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-doc_0.20.2+228-1~intrepid-cdh3b1_all.deb Size: 5830352 SHA256: 1a184c558bd1a9082a72e67f0d65dd1d8b587e9dc7eaeeb5b12c9220b884c5e6 SHA1: 73c59dc47a2ba388de5ad4c7bacc6e213ce04164 MD5sum: 5dbbb9c97ad42d6db6d20a14ba4a59b8 Description: Documentation for Hadoop This package contains the Java Documentation for Hadoop and its relevant APIs. Package: hadoop-0.20-jobtracker Source: hadoop-0.20 Version: 0.20.2+228-1~intrepid-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 204 Depends: hadoop-0.20 (= 0.20.2+228-1~intrepid-cdh3b1) Provides: hadoop-jobtracker Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-jobtracker_0.20.2+228-1~intrepid-cdh3b1_all.deb Size: 155818 SHA256: 977c43c91484c5396bc7c8602a16459dfeb231429f83fb51dfc1484edd8e645d SHA1: 488040ee3bdd2ee088bee8c48995be7fd5ec5901 MD5sum: fa7db281dbadc94e5ff5fe8688eb43ea Description: Job Tracker for Hadoop The jobtracker is a central service which is responsible for managing the tasktracker services running on all nodes in a Hadoop Cluster. The jobtracker allocates work to the tasktracker nearest to the data with an available work slot. Package: hadoop-0.20-namenode Source: hadoop-0.20 Version: 0.20.2+228-1~intrepid-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 204 Depends: hadoop-0.20 (= 0.20.2+228-1~intrepid-cdh3b1) Provides: hadoop-namenode Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-namenode_0.20.2+228-1~intrepid-cdh3b1_all.deb Size: 155786 SHA256: 82c04f532c928fe67983f0cbd43f8388dac4a328a463645a08b4f472d22be1eb SHA1: 3a01cd2a8ddde544dc7ab235103d83b35eac7543 MD5sum: 8c973b47a5a549b029831b6c6964b63c Description: Name Node for Hadoop The Hadoop Distributed Filesystem (HDFS) requires one unique server, the namenode, which manages the block locations of files on the filesystem. Package: hadoop-0.20-native Source: hadoop-0.20 Version: 0.20.2+228-1~intrepid-cdh3b1 Architecture: amd64 Maintainer: Todd Lipcon Installed-Size: 272 Depends: libc6 (>= 2.4), hadoop-0.20 (= 0.20.2+228-1~intrepid-cdh3b1), liblzo2-2, libz1 Enhances: hadoop-0.20 Provides: hadoop-native Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-native_0.20.2+228-1~intrepid-cdh3b1_amd64.deb Size: 164658 SHA256: 201db5593ecfc920016a766411568c949f1b2dac673867a423fb9b959d817948 SHA1: 0fc1ceb75b04b4819ea40c65f3eb25aed2f88547 MD5sum: c707fcd3c2e5b50df7e8ddab84f42074 Description: Native libraries for Hadoop (e.g., compression) This optional package contains native libraries that increase the performance of Hadoop's compression. Package: hadoop-0.20-pipes Source: hadoop-0.20 Version: 0.20.2+228-1~intrepid-cdh3b1 Architecture: amd64 Maintainer: Todd Lipcon Installed-Size: 476 Depends: hadoop-0.20 (= 0.20.2+228-1~intrepid-cdh3b1) Provides: hadoop-pipes Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-pipes_0.20.2+228-1~intrepid-cdh3b1_amd64.deb Size: 216930 SHA256: d885e8f6773aea00165dd8416800609b1d007718537b452190938e44391baff5 SHA1: e66fb9d3cd81a29a0726315f8141e515665625d8 MD5sum: 03cfbc4fd7a915dc1a9b2e36f5c2fae4 Description: Interface to author Hadoop MapReduce jobs in C++ Contains Hadoop Pipes, a library which allows Hadoop MapReduce jobs to be written in C++. Package: hadoop-0.20-secondarynamenode Source: hadoop-0.20 Version: 0.20.2+228-1~intrepid-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 204 Depends: hadoop-0.20 (= 0.20.2+228-1~intrepid-cdh3b1) Provides: hadoop-secondarynamenode Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-secondarynamenode_0.20.2+228-1~intrepid-cdh3b1_all.deb Size: 155816 SHA256: 66b80aac2b999e93646ab75cbd011b0bd37df7e5acc225d7c5f201adadcd8dde SHA1: d3a697004c3d87c1846482dc9293249547c24b76 MD5sum: 57e2548f5ab4d10fa237525e76176803 Description: Secondary Name Node for Hadoop The Secondary Name Node is responsible for checkpointing file system images. It is _not_ a failover pair for the namenode, and may safely be run on the same machine. Package: hadoop-0.20-source Source: hadoop-0.20 Version: 0.20.2+228-1~intrepid-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 52968 Provides: hadoop-source Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-source_0.20.2+228-1~intrepid-cdh3b1_all.deb Size: 16900638 SHA256: 3f15b24af1de51e60fe1acdcade7493b29b3802b4b1466267045781aace42811 SHA1: f3872bdd191001c346694c4f8ba85b14a3e5e095 MD5sum: c802ecbf28e804e17fb1fe7f4f029f8e Description: Source code for Hadoop This package contains the source code for Hadoop and its contrib modules. Package: hadoop-0.20-tasktracker Source: hadoop-0.20 Version: 0.20.2+228-1~intrepid-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 204 Depends: hadoop-0.20 (= 0.20.2+228-1~intrepid-cdh3b1) Provides: hadoop-tasktracker Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-tasktracker_0.20.2+228-1~intrepid-cdh3b1_all.deb Size: 155800 SHA256: ab832e488f69a6afb54ce03ac03ac827677577fc5a86b9c257b03fe95477e567 SHA1: 37682cfcf66430de28e61b40a3effc70c571c67a MD5sum: 2d84d385ec86441c505933f6cd3e0300 Description: Task Tracker for Hadoop The Task Tracker is the Hadoop service that accepts MapReduce tasks and computes results. Each node in a Hadoop cluster that should be doing computation should run a Task Tracker. Package: hadoop-hive Version: 0.5.0+20-1~intrepid-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 13132 Depends: sun-java6-jre | sun-java6-sdk, hadoop Homepage: http://hadoop.apache.org/hive/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-hive/hadoop-hive_0.5.0+20-1~intrepid-cdh3b1_all.deb Size: 11108902 SHA256: c30ed6cb0a924e6cf0137dd71e3f0c2f2b435b4d99dc29c7f3a333650f331502 SHA1: 2fb86c93cde2bfaec04f28c79e505060efb29269 MD5sum: 9024bec10663c9cfc49c8e57a95f7924 Description: A data warehouse infrastructure built on top of Hadoop Hive is a data warehouse infrastructure built on top of Hadoop that provides tools to enable easy data summarization, adhoc querying and analysis of large datasets data stored in Hadoop files. It provides a mechanism to put structure on this data and it also provides a simple query language called Hive QL which is based on SQL and which enables users familiar with SQL to query this data. At the same time, this language also allows traditional map/reduce programmers to be able to plug in their custom mappers and reducers to do more sophisticated analysis which may not be supported by the built-in capabilities of the language. Package: hadoop-pig Version: 0.5.0+30-1~intrepid-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 79756 Depends: sun-java6-jre | sun-java6-sdk, hadoop Homepage: http://hadoop.apache.org/pig/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-pig/hadoop-pig_0.5.0+30-1~intrepid-cdh3b1_all.deb Size: 30155750 SHA256: 8a309a4898553e4350b4bed4df86ce45a866b1fcf192f11900064794363cc4ad SHA1: b8db90c7c6b704b3350736ca8616c4fac26951ed MD5sum: 5aeef31ed69b5de9bd86e3dfc17336ac Description: A platform for analyzing large data sets using Hadoop Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, coupled with infrastructure for evaluating these programs. The salient property of Pig programs is that their structure is amenable to substantial parallelization, which in turns enables them to handle very large data sets. . At the present time, Pig's infrastructure layer consists of a compiler that produces sequences of Map-Reduce programs, for which large-scale parallel implementations already exist (e.g., the Hadoop subproject). Pig's language layer currently consists of a textual language called Pig Latin, which has the following key properties: . * Ease of programming It is trivial to achieve parallel execution of simple, "embarrassingly parallel" data analysis tasks. Complex tasks comprised of multiple interrelated data transformations are explicitly encoded as data flow sequences, making them easy to write, understand, and maintain. * Optimization opportunities The way in which tasks are encoded permits the system to optimize their execution automatically, allowing the user to focus on semantics rather than efficiency. * Extensibility Users can create their own functions to do special-purpose processing. Package: libhdfs0 Source: hadoop-0.20 Version: 0.20.2+228-1~intrepid-cdh3b1 Architecture: amd64 Maintainer: Todd Lipcon Installed-Size: 240 Depends: hadoop-0.20 (= 0.20.2+228-1~intrepid-cdh3b1), libc6 (>= 2.4) Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/libhdfs0_0.20.2+228-1~intrepid-cdh3b1_amd64.deb Size: 171456 SHA256: bc9a41295f7d945b1f8064a716b4acae3d8f4b9034bdbdadae7339d777bece22 SHA1: da89b6fa839667d7028dda42019206ecf3f8a37d MD5sum: 395bf45cf2fef5de25b8ed254b722d7f Description: JNI Bindings to access Hadoop HDFS from C See http://wiki.apache.org/hadoop/LibHDFS Package: libhdfs0-dev Source: hadoop-0.20 Version: 0.20.2+228-1~intrepid-cdh3b1 Architecture: amd64 Maintainer: Todd Lipcon Installed-Size: 228 Depends: hadoop-0.20 (= 0.20.2+228-1~intrepid-cdh3b1), libhdfs0 (= 0.20.2+228-1~intrepid-cdh3b1) Homepage: http://hadoop.apache.org/core/ Priority: extra Section: libdevel Filename: pool/contrib/h/hadoop-0.20/libhdfs0-dev_0.20.2+228-1~intrepid-cdh3b1_amd64.deb Size: 165520 SHA256: 5d197d34ac54886c65c8accaf7551173f4bad08da8f5d614bedaab05fadd3a39 SHA1: 960a60c34ee6a64b344c77c30960392db204fe7c MD5sum: 511f98c0349de8e7e304a5f086d66a4e Description: Development support for libhdfs0 Includes examples and header files for accessing HDFS from C Package: python-hive Source: hadoop-hive Version: 0.5.0+20-1~intrepid-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 700 Depends: python (>= 2.4), python-support (>= 0.7.1) Provides: python2.4-hive, python2.5-hive Homepage: http://hadoop.apache.org/hive/ Priority: extra Section: python Filename: pool/contrib/h/hadoop-hive/python-hive_0.5.0+20-1~intrepid-cdh3b1_all.deb Size: 55592 SHA256: 0f1f7b4a406ea3afc2a1faf28d03b4bb3f62a9b7af39e641bf627bf070894481 SHA1: 7aca689b10dfe88e0cc31612ccec23a8f5f656ca MD5sum: 2aeed50c1db50ddd2c28ae6337dae77f Description: Python client library to talk to the Hive Metastore This is a generated Thrift client to talk to the Hive Metastore.