Package: hadoop-0.20 Version: 0.20.2+228-1~hardy-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 44324 Depends: adduser, sun-java6-bin, sun-java6-jre Recommends: hadoop-0.20-native Provides: hadoop Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20_0.20.2+228-1~hardy-cdh3b1_all.deb Size: 22276578 SHA256: b69144f3214877724501ac4360ae23759826601a58011f437d54ae6615bf7d13 SHA1: 3b58625904f5a4cb41145b42c9bbe4775307042d MD5sum: 693aa54d836b60e6a2a52928fa4e7f59 Description: A software platform for processing vast amounts of data Hadoop is a software platform that lets one easily write and run applications that process vast amounts of data. . Here's what makes Hadoop especially useful: * Scalable: Hadoop can reliably store and process petabytes. * Economical: It distributes the data and processing across clusters of commonly available computers. These clusters can number into the thousands of nodes. * Efficient: By distributing the data, Hadoop can process it in parallel on the nodes where the data is located. This makes it extremely rapid. * Reliable: Hadoop automatically maintains multiple copies of data and automatically redeploys computing tasks based on failures. . Hadoop implements MapReduce, using the Hadoop Distributed File System (HDFS). MapReduce divides applications into many small blocks of work. HDFS creates multiple replicas of data blocks for reliability, placing them on compute nodes around the cluster. MapReduce can then process the data where it is located. Package: hadoop-0.20-conf-pseudo Source: hadoop-0.20 Version: 0.20.2+228-1~hardy-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 272 Depends: hadoop-0.20 (= 0.20.2+228-1~hardy-cdh3b1) Provides: hadoop-conf-pseudo Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-conf-pseudo_0.20.2+228-1~hardy-cdh3b1_all.deb Size: 163240 SHA256: 9182f8f51ad6e659bb9bfe1a00b53ab15d9a0a7d0ce675a6002ce9722ca90854 SHA1: cb744cc49c7d00335a469b937cb6951d2514e4d7 MD5sum: 6ca6432251a75e76e48a055a193ae1ea Description: Pseudo-distributed Hadoop configuration Contains configuration files for a "pseudo-distributed" Hadoop deployment. In this mode, each of the hadoop components runs as a separate Java process, but all on the same machine. Package: hadoop-0.20-datanode Source: hadoop-0.20 Version: 0.20.2+228-1~hardy-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 204 Depends: hadoop-0.20 (= 0.20.2+228-1~hardy-cdh3b1) Provides: hadoop-datanode Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-datanode_0.20.2+228-1~hardy-cdh3b1_all.deb Size: 155808 SHA256: 38286bcdbb6bb986871a2b943d3fba90f1904ba3b16b39531fe226f66305f13e SHA1: b82738cc931bac19ee271466f358f525c9045573 MD5sum: 6c1749212355ca3c35848ea2e5de0c96 Description: Data Node for Hadoop The Data Nodes in the Hadoop Cluster are responsible for serving up blocks of data over the network to Hadoop Distributed Filesystem (HDFS) clients. Package: hadoop-0.20-doc Source: hadoop-0.20 Version: 0.20.2+228-1~hardy-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 48444 Provides: hadoop-doc Homepage: http://hadoop.apache.org/core/ Priority: extra Section: doc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-doc_0.20.2+228-1~hardy-cdh3b1_all.deb Size: 5815162 SHA256: 4cf51a6a471ee85998f843a9a2bd9998ab6fa375767c01e4f776390da35c28ea SHA1: e17f66a0e23646a02457aa0f15a3b04605988ea5 MD5sum: 831e85b47bfe2252588bdd4b30961b3f Description: Documentation for Hadoop This package contains the Java Documentation for Hadoop and its relevant APIs. Package: hadoop-0.20-jobtracker Source: hadoop-0.20 Version: 0.20.2+228-1~hardy-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 204 Depends: hadoop-0.20 (= 0.20.2+228-1~hardy-cdh3b1) Provides: hadoop-jobtracker Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-jobtracker_0.20.2+228-1~hardy-cdh3b1_all.deb Size: 155834 SHA256: 2f0dd03e110c65954e0e925a647aa0760e3e985298d9eb722b6fadaf5431e0fb SHA1: ca9d78f039b4dd6c92e1feebe5e3291bd72637f7 MD5sum: 30b401d8147fe6802228f9b886095181 Description: Job Tracker for Hadoop The jobtracker is a central service which is responsible for managing the tasktracker services running on all nodes in a Hadoop Cluster. The jobtracker allocates work to the tasktracker nearest to the data with an available work slot. Package: hadoop-0.20-namenode Source: hadoop-0.20 Version: 0.20.2+228-1~hardy-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 204 Depends: hadoop-0.20 (= 0.20.2+228-1~hardy-cdh3b1) Provides: hadoop-namenode Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-namenode_0.20.2+228-1~hardy-cdh3b1_all.deb Size: 155802 SHA256: 73e3c7ea382fc27c205c7ec5b38a6855b695118fc25347dcee75429d9a6dfc40 SHA1: f9a0146c5cf4b7558c3cd49f42e6d16fa5db363d MD5sum: 4018e6a056f96b0fa4b2569ae811a4d4 Description: Name Node for Hadoop The Hadoop Distributed Filesystem (HDFS) requires one unique server, the namenode, which manages the block locations of files on the filesystem. Package: hadoop-0.20-native Source: hadoop-0.20 Version: 0.20.2+228-1~hardy-cdh3b1 Architecture: amd64 Maintainer: Todd Lipcon Installed-Size: 260 Depends: hadoop-0.20 (= 0.20.2+228-1~hardy-cdh3b1), libc6 (>= 2.4), liblzo2-2, libz1 Enhances: hadoop-0.20 Provides: hadoop-native Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-native_0.20.2+228-1~hardy-cdh3b1_amd64.deb Size: 164522 SHA256: 332a34422b206ea89d409fe7629d1548ea6eca44a23ee3b9eac2555f48ffe9c7 SHA1: 939b0ec50befee58d10e658efde49214b296bd26 MD5sum: 6d8f1d8b37fb8a5a4c054825ad18cda2 Description: Native libraries for Hadoop (e.g., compression) This optional package contains native libraries that increase the performance of Hadoop's compression. Package: hadoop-0.20-pipes Source: hadoop-0.20 Version: 0.20.2+228-1~hardy-cdh3b1 Architecture: amd64 Maintainer: Todd Lipcon Installed-Size: 512 Depends: hadoop-0.20 (= 0.20.2+228-1~hardy-cdh3b1) Provides: hadoop-pipes Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-pipes_0.20.2+228-1~hardy-cdh3b1_amd64.deb Size: 222052 SHA256: 33cb99fd9f090393ad003d544c76df4b88d2cae3cb7fa98847a2fbcf73cb2381 SHA1: 85b79dbbaa79f5f3858b8b0109cd35d559d1fd86 MD5sum: 3fa0377b8713a422a0c5e06d813f2ad3 Description: Interface to author Hadoop MapReduce jobs in C++ Contains Hadoop Pipes, a library which allows Hadoop MapReduce jobs to be written in C++. Package: hadoop-0.20-secondarynamenode Source: hadoop-0.20 Version: 0.20.2+228-1~hardy-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 204 Depends: hadoop-0.20 (= 0.20.2+228-1~hardy-cdh3b1) Provides: hadoop-secondarynamenode Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-secondarynamenode_0.20.2+228-1~hardy-cdh3b1_all.deb Size: 155832 SHA256: a3d3b0cd739609d1a685fae4735113771ded61ea08bd928713cb52b738624f22 SHA1: ea03ddaadb8df3fff7cc351569bb196f08895d67 MD5sum: b32418814804e5832c865d123221ed85 Description: Secondary Name Node for Hadoop The Secondary Name Node is responsible for checkpointing file system images. It is _not_ a failover pair for the namenode, and may safely be run on the same machine. Package: hadoop-0.20-source Source: hadoop-0.20 Version: 0.20.2+228-1~hardy-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 52968 Provides: hadoop-source Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-source_0.20.2+228-1~hardy-cdh3b1_all.deb Size: 16894716 SHA256: 5bc3b7b285e60b52f1611589349cb398e20d85bbaed316bd507cc9c878f3473a SHA1: cee8f045520072946889ecd00e3cfe540bf87eed MD5sum: b14cecf50fcd946726daa3ae6d1ad172 Description: Source code for Hadoop This package contains the source code for Hadoop and its contrib modules. Package: hadoop-0.20-tasktracker Source: hadoop-0.20 Version: 0.20.2+228-1~hardy-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 204 Depends: hadoop-0.20 (= 0.20.2+228-1~hardy-cdh3b1) Provides: hadoop-tasktracker Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-tasktracker_0.20.2+228-1~hardy-cdh3b1_all.deb Size: 155810 SHA256: d0a59443918022a015fca226b51d3698a9e4b99c2b450ac2ba2fbb6361ec8407 SHA1: 75c3a498e782613df78c267737967593b8ca2e17 MD5sum: b8b7fedbc37331f45500d1a7e10e7803 Description: Task Tracker for Hadoop The Task Tracker is the Hadoop service that accepts MapReduce tasks and computes results. Each node in a Hadoop cluster that should be doing computation should run a Task Tracker. Package: hadoop-hive Version: 0.5.0+20-1~hardy-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 13132 Depends: hadoop, sun-java6-jre | sun-java6-sdk Homepage: http://hadoop.apache.org/hive/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-hive/hadoop-hive_0.5.0+20-1~hardy-cdh3b1_all.deb Size: 11109988 SHA256: a7cdc59dc8b0ed9eb8133eda1d0c1d64a311e62ab9147cf56de5b8a72ef1d21f SHA1: cc68b1b803355dc8a2af41bad131d5c6624c7406 MD5sum: b9e7084f051962997ef7937abcb0a31e Description: A data warehouse infrastructure built on top of Hadoop Hive is a data warehouse infrastructure built on top of Hadoop that provides tools to enable easy data summarization, adhoc querying and analysis of large datasets data stored in Hadoop files. It provides a mechanism to put structure on this data and it also provides a simple query language called Hive QL which is based on SQL and which enables users familiar with SQL to query this data. At the same time, this language also allows traditional map/reduce programmers to be able to plug in their custom mappers and reducers to do more sophisticated analysis which may not be supported by the built-in capabilities of the language. Package: hadoop-pig Version: 0.5.0+30-1~hardy-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 79756 Depends: hadoop, sun-java6-jre | sun-java6-sdk Homepage: http://hadoop.apache.org/pig/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-pig/hadoop-pig_0.5.0+30-1~hardy-cdh3b1_all.deb Size: 30154696 SHA256: 7780979d615e61809344497bbde3c1220f38dde5009d924770649f0c2c602c13 SHA1: b403590cf9d3757e7bc26cf021188217c32ea400 MD5sum: c95576c766359c86be8fa812a7c94a8e Description: A platform for analyzing large data sets using Hadoop Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, coupled with infrastructure for evaluating these programs. The salient property of Pig programs is that their structure is amenable to substantial parallelization, which in turns enables them to handle very large data sets. . At the present time, Pig's infrastructure layer consists of a compiler that produces sequences of Map-Reduce programs, for which large-scale parallel implementations already exist (e.g., the Hadoop subproject). Pig's language layer currently consists of a textual language called Pig Latin, which has the following key properties: . * Ease of programming It is trivial to achieve parallel execution of simple, "embarrassingly parallel" data analysis tasks. Complex tasks comprised of multiple interrelated data transformations are explicitly encoded as data flow sequences, making them easy to write, understand, and maintain. * Optimization opportunities The way in which tasks are encoded permits the system to optimize their execution automatically, allowing the user to focus on semantics rather than efficiency. * Extensibility Users can create their own functions to do special-purpose processing. Package: libhdfs0 Source: hadoop-0.20 Version: 0.20.2+228-1~hardy-cdh3b1 Architecture: amd64 Maintainer: Todd Lipcon Installed-Size: 240 Depends: hadoop-0.20 (= 0.20.2+228-1~hardy-cdh3b1), libc6 (>= 2.4) Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/libhdfs0_0.20.2+228-1~hardy-cdh3b1_amd64.deb Size: 171316 SHA256: f1ae9a1f8bd448d1d7251b686d1b2949f3f5fb6ca158ee25008173f0002cfa32 SHA1: f86fc0776fb36683ab9e6148e91b6505cdc0c715 MD5sum: 921b98253f5c0d4d5fe2657fdcd39319 Description: JNI Bindings to access Hadoop HDFS from C See http://wiki.apache.org/hadoop/LibHDFS Package: libhdfs0-dev Source: hadoop-0.20 Version: 0.20.2+228-1~hardy-cdh3b1 Architecture: amd64 Maintainer: Todd Lipcon Installed-Size: 228 Depends: hadoop-0.20 (= 0.20.2+228-1~hardy-cdh3b1), libhdfs0 (= 0.20.2+228-1~hardy-cdh3b1) Homepage: http://hadoop.apache.org/core/ Priority: extra Section: libdevel Filename: pool/contrib/h/hadoop-0.20/libhdfs0-dev_0.20.2+228-1~hardy-cdh3b1_amd64.deb Size: 165474 SHA256: 73004ba69b81311c39c55b1f462d10e6f9f3018ce7cccd50195c43efea1fb052 SHA1: 7452b27ae6d579ce7098f7415c86e4308fc5344b MD5sum: ea3477471e10065c59ce1f5f564ee48d Description: Development support for libhdfs0 Includes examples and header files for accessing HDFS from C Package: python-hive Source: hadoop-hive Version: 0.5.0+20-1~hardy-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 700 Depends: python (>= 2.4), python-support (>= 0.7.1) Provides: python2.4-hive, python2.5-hive Homepage: http://hadoop.apache.org/hive/ Priority: extra Section: python Filename: pool/contrib/h/hadoop-hive/python-hive_0.5.0+20-1~hardy-cdh3b1_all.deb Size: 53750 SHA256: 57cc627815c0b75be638427a8af262b211b672a61d918613068d2f8cfde9545e SHA1: b000f8819a48f8227ed5d62f1018e1ceac7564c8 MD5sum: 69321bb2aeec68183e9df047562ae573 Description: Python client library to talk to the Hive Metastore This is a generated Thrift client to talk to the Hive Metastore.