Package: hadoop-0.20 Version: 0.20.2+228-1~hardy-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 44324 Depends: adduser, sun-java6-bin, sun-java6-jre Recommends: hadoop-0.20-native Provides: hadoop Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20_0.20.2+228-1~hardy-cdh3b1_all.deb Size: 22276578 SHA256: b69144f3214877724501ac4360ae23759826601a58011f437d54ae6615bf7d13 SHA1: 3b58625904f5a4cb41145b42c9bbe4775307042d MD5sum: 693aa54d836b60e6a2a52928fa4e7f59 Description: A software platform for processing vast amounts of data Hadoop is a software platform that lets one easily write and run applications that process vast amounts of data. . Here's what makes Hadoop especially useful: * Scalable: Hadoop can reliably store and process petabytes. * Economical: It distributes the data and processing across clusters of commonly available computers. These clusters can number into the thousands of nodes. * Efficient: By distributing the data, Hadoop can process it in parallel on the nodes where the data is located. This makes it extremely rapid. * Reliable: Hadoop automatically maintains multiple copies of data and automatically redeploys computing tasks based on failures. . Hadoop implements MapReduce, using the Hadoop Distributed File System (HDFS). MapReduce divides applications into many small blocks of work. HDFS creates multiple replicas of data blocks for reliability, placing them on compute nodes around the cluster. MapReduce can then process the data where it is located. Package: hadoop-0.20-conf-pseudo Source: hadoop-0.20 Version: 0.20.2+228-1~hardy-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 272 Depends: hadoop-0.20 (= 0.20.2+228-1~hardy-cdh3b1) Provides: hadoop-conf-pseudo Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-conf-pseudo_0.20.2+228-1~hardy-cdh3b1_all.deb Size: 163240 SHA256: 9182f8f51ad6e659bb9bfe1a00b53ab15d9a0a7d0ce675a6002ce9722ca90854 SHA1: cb744cc49c7d00335a469b937cb6951d2514e4d7 MD5sum: 6ca6432251a75e76e48a055a193ae1ea Description: Pseudo-distributed Hadoop configuration Contains configuration files for a "pseudo-distributed" Hadoop deployment. In this mode, each of the hadoop components runs as a separate Java process, but all on the same machine. Package: hadoop-0.20-datanode Source: hadoop-0.20 Version: 0.20.2+228-1~hardy-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 204 Depends: hadoop-0.20 (= 0.20.2+228-1~hardy-cdh3b1) Provides: hadoop-datanode Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-datanode_0.20.2+228-1~hardy-cdh3b1_all.deb Size: 155808 SHA256: 38286bcdbb6bb986871a2b943d3fba90f1904ba3b16b39531fe226f66305f13e SHA1: b82738cc931bac19ee271466f358f525c9045573 MD5sum: 6c1749212355ca3c35848ea2e5de0c96 Description: Data Node for Hadoop The Data Nodes in the Hadoop Cluster are responsible for serving up blocks of data over the network to Hadoop Distributed Filesystem (HDFS) clients. Package: hadoop-0.20-doc Source: hadoop-0.20 Version: 0.20.2+228-1~hardy-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 48444 Provides: hadoop-doc Homepage: http://hadoop.apache.org/core/ Priority: extra Section: doc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-doc_0.20.2+228-1~hardy-cdh3b1_all.deb Size: 5815162 SHA256: 4cf51a6a471ee85998f843a9a2bd9998ab6fa375767c01e4f776390da35c28ea SHA1: e17f66a0e23646a02457aa0f15a3b04605988ea5 MD5sum: 831e85b47bfe2252588bdd4b30961b3f Description: Documentation for Hadoop This package contains the Java Documentation for Hadoop and its relevant APIs. Package: hadoop-0.20-jobtracker Source: hadoop-0.20 Version: 0.20.2+228-1~hardy-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 204 Depends: hadoop-0.20 (= 0.20.2+228-1~hardy-cdh3b1) Provides: hadoop-jobtracker Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-jobtracker_0.20.2+228-1~hardy-cdh3b1_all.deb Size: 155834 SHA256: 2f0dd03e110c65954e0e925a647aa0760e3e985298d9eb722b6fadaf5431e0fb SHA1: ca9d78f039b4dd6c92e1feebe5e3291bd72637f7 MD5sum: 30b401d8147fe6802228f9b886095181 Description: Job Tracker for Hadoop The jobtracker is a central service which is responsible for managing the tasktracker services running on all nodes in a Hadoop Cluster. The jobtracker allocates work to the tasktracker nearest to the data with an available work slot. Package: hadoop-0.20-namenode Source: hadoop-0.20 Version: 0.20.2+228-1~hardy-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 204 Depends: hadoop-0.20 (= 0.20.2+228-1~hardy-cdh3b1) Provides: hadoop-namenode Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-namenode_0.20.2+228-1~hardy-cdh3b1_all.deb Size: 155802 SHA256: 73e3c7ea382fc27c205c7ec5b38a6855b695118fc25347dcee75429d9a6dfc40 SHA1: f9a0146c5cf4b7558c3cd49f42e6d16fa5db363d MD5sum: 4018e6a056f96b0fa4b2569ae811a4d4 Description: Name Node for Hadoop The Hadoop Distributed Filesystem (HDFS) requires one unique server, the namenode, which manages the block locations of files on the filesystem. Package: hadoop-0.20-native Source: hadoop-0.20 Version: 0.20.2+228-1~hardy-cdh3b1 Architecture: i386 Maintainer: Todd Lipcon Installed-Size: 252 Depends: hadoop-0.20 (= 0.20.2+228-1~hardy-cdh3b1), libc6 (>= 2.4), liblzo2-2, libz1 Enhances: hadoop-0.20 Provides: hadoop-native Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-native_0.20.2+228-1~hardy-cdh3b1_i386.deb Size: 164098 SHA256: 4015305a7061223a9e4f280936df786c41ca66cef5af0ba8ab590e3a11b5822c SHA1: 548fe9f334ec3694ac4905bd992f8455f2abac7d MD5sum: febe2466c07e43dbd95f9eabf5fede45 Description: Native libraries for Hadoop (e.g., compression) This optional package contains native libraries that increase the performance of Hadoop's compression. Package: hadoop-0.20-pipes Source: hadoop-0.20 Version: 0.20.2+228-1~hardy-cdh3b1 Architecture: i386 Maintainer: Todd Lipcon Installed-Size: 432 Depends: hadoop-0.20 (= 0.20.2+228-1~hardy-cdh3b1) Provides: hadoop-pipes Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-pipes_0.20.2+228-1~hardy-cdh3b1_i386.deb Size: 218132 SHA256: 99855c1bf40a804928d0364372b66ead590d76c23e7bed2abb5c873c6fe342eb SHA1: 91155a84da36cd8a0f81ba14a2fe83e5b1ab052e MD5sum: ca4389c49ae1e969a4d965a7e5804b00 Description: Interface to author Hadoop MapReduce jobs in C++ Contains Hadoop Pipes, a library which allows Hadoop MapReduce jobs to be written in C++. Package: hadoop-0.20-secondarynamenode Source: hadoop-0.20 Version: 0.20.2+228-1~hardy-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 204 Depends: hadoop-0.20 (= 0.20.2+228-1~hardy-cdh3b1) Provides: hadoop-secondarynamenode Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-secondarynamenode_0.20.2+228-1~hardy-cdh3b1_all.deb Size: 155832 SHA256: a3d3b0cd739609d1a685fae4735113771ded61ea08bd928713cb52b738624f22 SHA1: ea03ddaadb8df3fff7cc351569bb196f08895d67 MD5sum: b32418814804e5832c865d123221ed85 Description: Secondary Name Node for Hadoop The Secondary Name Node is responsible for checkpointing file system images. It is _not_ a failover pair for the namenode, and may safely be run on the same machine. Package: hadoop-0.20-source Source: hadoop-0.20 Version: 0.20.2+228-1~hardy-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 52968 Provides: hadoop-source Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-source_0.20.2+228-1~hardy-cdh3b1_all.deb Size: 16894716 SHA256: 5bc3b7b285e60b52f1611589349cb398e20d85bbaed316bd507cc9c878f3473a SHA1: cee8f045520072946889ecd00e3cfe540bf87eed MD5sum: b14cecf50fcd946726daa3ae6d1ad172 Description: Source code for Hadoop This package contains the source code for Hadoop and its contrib modules. Package: hadoop-0.20-tasktracker Source: hadoop-0.20 Version: 0.20.2+228-1~hardy-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 204 Depends: hadoop-0.20 (= 0.20.2+228-1~hardy-cdh3b1) Provides: hadoop-tasktracker Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-tasktracker_0.20.2+228-1~hardy-cdh3b1_all.deb Size: 155810 SHA256: d0a59443918022a015fca226b51d3698a9e4b99c2b450ac2ba2fbb6361ec8407 SHA1: 75c3a498e782613df78c267737967593b8ca2e17 MD5sum: b8b7fedbc37331f45500d1a7e10e7803 Description: Task Tracker for Hadoop The Task Tracker is the Hadoop service that accepts MapReduce tasks and computes results. Each node in a Hadoop cluster that should be doing computation should run a Task Tracker. Package: hadoop-hive Version: 0.5.0+20-1~hardy-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 13132 Depends: hadoop, sun-java6-jre | sun-java6-sdk Homepage: http://hadoop.apache.org/hive/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-hive/hadoop-hive_0.5.0+20-1~hardy-cdh3b1_all.deb Size: 11109988 SHA256: a7cdc59dc8b0ed9eb8133eda1d0c1d64a311e62ab9147cf56de5b8a72ef1d21f SHA1: cc68b1b803355dc8a2af41bad131d5c6624c7406 MD5sum: b9e7084f051962997ef7937abcb0a31e Description: A data warehouse infrastructure built on top of Hadoop Hive is a data warehouse infrastructure built on top of Hadoop that provides tools to enable easy data summarization, adhoc querying and analysis of large datasets data stored in Hadoop files. It provides a mechanism to put structure on this data and it also provides a simple query language called Hive QL which is based on SQL and which enables users familiar with SQL to query this data. At the same time, this language also allows traditional map/reduce programmers to be able to plug in their custom mappers and reducers to do more sophisticated analysis which may not be supported by the built-in capabilities of the language. Package: hadoop-pig Version: 0.5.0+30-1~hardy-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 79756 Depends: hadoop, sun-java6-jre | sun-java6-sdk Homepage: http://hadoop.apache.org/pig/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-pig/hadoop-pig_0.5.0+30-1~hardy-cdh3b1_all.deb Size: 30154696 SHA256: 7780979d615e61809344497bbde3c1220f38dde5009d924770649f0c2c602c13 SHA1: b403590cf9d3757e7bc26cf021188217c32ea400 MD5sum: c95576c766359c86be8fa812a7c94a8e Description: A platform for analyzing large data sets using Hadoop Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, coupled with infrastructure for evaluating these programs. The salient property of Pig programs is that their structure is amenable to substantial parallelization, which in turns enables them to handle very large data sets. . At the present time, Pig's infrastructure layer consists of a compiler that produces sequences of Map-Reduce programs, for which large-scale parallel implementations already exist (e.g., the Hadoop subproject). Pig's language layer currently consists of a textual language called Pig Latin, which has the following key properties: . * Ease of programming It is trivial to achieve parallel execution of simple, "embarrassingly parallel" data analysis tasks. Complex tasks comprised of multiple interrelated data transformations are explicitly encoded as data flow sequences, making them easy to write, understand, and maintain. * Optimization opportunities The way in which tasks are encoded permits the system to optimize their execution automatically, allowing the user to focus on semantics rather than efficiency. * Extensibility Users can create their own functions to do special-purpose processing. Package: libhdfs0 Source: hadoop-0.20 Version: 0.20.2+228-1~hardy-cdh3b1 Architecture: i386 Maintainer: Todd Lipcon Installed-Size: 236 Depends: hadoop-0.20 (= 0.20.2+228-1~hardy-cdh3b1), libc6 (>= 2.4) Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/libhdfs0_0.20.2+228-1~hardy-cdh3b1_i386.deb Size: 170042 SHA256: bfa78cc4c6b4c95294f31b2cd3317d9690b50e3429bcac7549745e9f190df9d5 SHA1: e05c916c3a7bfc7edd692d702e5075e42dd74c8f MD5sum: 716cd31aec462ae90011823a5a013a60 Description: JNI Bindings to access Hadoop HDFS from C See http://wiki.apache.org/hadoop/LibHDFS Package: libhdfs0-dev Source: hadoop-0.20 Version: 0.20.2+228-1~hardy-cdh3b1 Architecture: i386 Maintainer: Todd Lipcon Installed-Size: 228 Depends: hadoop-0.20 (= 0.20.2+228-1~hardy-cdh3b1), libhdfs0 (= 0.20.2+228-1~hardy-cdh3b1) Homepage: http://hadoop.apache.org/core/ Priority: extra Section: libdevel Filename: pool/contrib/h/hadoop-0.20/libhdfs0-dev_0.20.2+228-1~hardy-cdh3b1_i386.deb Size: 164762 SHA256: 176cbce5e893c920f905a6680b7ad2654be9260484abe43d1b91467961fe0249 SHA1: 412293785ec55e1b474db7476d906b934e9ad2b3 MD5sum: 43a83af3166b109d6b7ebd3c66e18345 Description: Development support for libhdfs0 Includes examples and header files for accessing HDFS from C Package: python-hive Source: hadoop-hive Version: 0.5.0+20-1~hardy-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 700 Depends: python (>= 2.4), python-support (>= 0.7.1) Provides: python2.4-hive, python2.5-hive Homepage: http://hadoop.apache.org/hive/ Priority: extra Section: python Filename: pool/contrib/h/hadoop-hive/python-hive_0.5.0+20-1~hardy-cdh3b1_all.deb Size: 53750 SHA256: 57cc627815c0b75be638427a8af262b211b672a61d918613068d2f8cfde9545e SHA1: b000f8819a48f8227ed5d62f1018e1ceac7564c8 MD5sum: 69321bb2aeec68183e9df047562ae573 Description: Python client library to talk to the Hive Metastore This is a generated Thrift client to talk to the Hive Metastore.