Package: hadoop-0.20 Version: 0.20.2+228-1~jaunty-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 44332 Depends: adduser, sun-java6-jre, sun-java6-bin Recommends: hadoop-0.20-native Provides: hadoop Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20_0.20.2+228-1~jaunty-cdh3b1_all.deb Size: 22271618 SHA256: f2ee9c12f375c6e692cb789228ac3a368190863a43cc74c1b8785728d051efe4 SHA1: 245db72832f9715b79346b661e990248a40a52cb MD5sum: 100f20c0a07f398452431e09e39cb5cb Description: A software platform for processing vast amounts of data Hadoop is a software platform that lets one easily write and run applications that process vast amounts of data. . Here's what makes Hadoop especially useful: * Scalable: Hadoop can reliably store and process petabytes. * Economical: It distributes the data and processing across clusters of commonly available computers. These clusters can number into the thousands of nodes. * Efficient: By distributing the data, Hadoop can process it in parallel on the nodes where the data is located. This makes it extremely rapid. * Reliable: Hadoop automatically maintains multiple copies of data and automatically redeploys computing tasks based on failures. . Hadoop implements MapReduce, using the Hadoop Distributed File System (HDFS). MapReduce divides applications into many small blocks of work. HDFS creates multiple replicas of data blocks for reliability, placing them on compute nodes around the cluster. MapReduce can then process the data where it is located. Package: hadoop-0.20-conf-pseudo Source: hadoop-0.20 Version: 0.20.2+228-1~jaunty-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 284 Depends: hadoop-0.20 (= 0.20.2+228-1~jaunty-cdh3b1) Provides: hadoop-conf-pseudo Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-conf-pseudo_0.20.2+228-1~jaunty-cdh3b1_all.deb Size: 163062 SHA256: 2ca6822a66616f782defab7caf13e022d004f4d28a001e16ed59b2ebd30b5298 SHA1: 10181be609934d4266490406fd28801d2b8eb6d0 MD5sum: 6483905133140c8f8dbec5825a7b11d2 Description: Pseudo-distributed Hadoop configuration Contains configuration files for a "pseudo-distributed" Hadoop deployment. In this mode, each of the hadoop components runs as a separate Java process, but all on the same machine. Package: hadoop-0.20-datanode Source: hadoop-0.20 Version: 0.20.2+228-1~jaunty-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 204 Depends: hadoop-0.20 (= 0.20.2+228-1~jaunty-cdh3b1) Provides: hadoop-datanode Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-datanode_0.20.2+228-1~jaunty-cdh3b1_all.deb Size: 155862 SHA256: d8a8793143494b94312379be8ab434d753f5b1e355a377c1a2d10fa1a881332d SHA1: 0ac2f669758b3aa775f6b3d82a2fdd7b0e825b97 MD5sum: 40934590c8f717a0cdd0b7341005998b Description: Data Node for Hadoop The Data Nodes in the Hadoop Cluster are responsible for serving up blocks of data over the network to Hadoop Distributed Filesystem (HDFS) clients. Package: hadoop-0.20-doc Source: hadoop-0.20 Version: 0.20.2+228-1~jaunty-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 48456 Provides: hadoop-doc Homepage: http://hadoop.apache.org/core/ Priority: extra Section: doc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-doc_0.20.2+228-1~jaunty-cdh3b1_all.deb Size: 5830686 SHA256: 119e270340dbd7a22d55c8286483d48b66e28876191d1432bfd8ca427643997c SHA1: 7dc779a7e517a72ee323667675d08c51305a226c MD5sum: e238f23fa16cf641bf01da939153ccf3 Description: Documentation for Hadoop This package contains the Java Documentation for Hadoop and its relevant APIs. Package: hadoop-0.20-jobtracker Source: hadoop-0.20 Version: 0.20.2+228-1~jaunty-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 204 Depends: hadoop-0.20 (= 0.20.2+228-1~jaunty-cdh3b1) Provides: hadoop-jobtracker Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-jobtracker_0.20.2+228-1~jaunty-cdh3b1_all.deb Size: 155902 SHA256: f279b1f21b6d37462729a16f3daf139bdc5bd0fde5dfe52104db87c620e6957b SHA1: cce496fa10a944747c972cd06e39b0ff47c373a3 MD5sum: ea3fa8b867c70baec5b888cdf4aeb661 Description: Job Tracker for Hadoop The jobtracker is a central service which is responsible for managing the tasktracker services running on all nodes in a Hadoop Cluster. The jobtracker allocates work to the tasktracker nearest to the data with an available work slot. Package: hadoop-0.20-namenode Source: hadoop-0.20 Version: 0.20.2+228-1~jaunty-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 204 Depends: hadoop-0.20 (= 0.20.2+228-1~jaunty-cdh3b1) Provides: hadoop-namenode Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-namenode_0.20.2+228-1~jaunty-cdh3b1_all.deb Size: 155864 SHA256: 8e5c03c3d114a5c4a450cf8077cec80958c88d893d1810995f090f6f519f66ca SHA1: 01b9ebdc80d0d2a4b876a1e5cdcff1be6954e068 MD5sum: 0070f7871c3c759bab06290e00418fca Description: Name Node for Hadoop The Hadoop Distributed Filesystem (HDFS) requires one unique server, the namenode, which manages the block locations of files on the filesystem. Package: hadoop-0.20-native Source: hadoop-0.20 Version: 0.20.2+228-1~jaunty-cdh3b1 Architecture: amd64 Maintainer: Todd Lipcon Installed-Size: 272 Depends: libc6 (>= 2.4), hadoop-0.20 (= 0.20.2+228-1~jaunty-cdh3b1), liblzo2-2, libz1 Enhances: hadoop-0.20 Provides: hadoop-native Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-native_0.20.2+228-1~jaunty-cdh3b1_amd64.deb Size: 165084 SHA256: b0ef63d52e17ef9b9de639958074779e7df8ca180f63a83abdd79d66720ca242 SHA1: 1b05f9934762bc2b35958591cfc867c3976ba72a MD5sum: e31e7f6aff6d90147f15c9491cb994ee Description: Native libraries for Hadoop (e.g., compression) This optional package contains native libraries that increase the performance of Hadoop's compression. Package: hadoop-0.20-pipes Source: hadoop-0.20 Version: 0.20.2+228-1~jaunty-cdh3b1 Architecture: amd64 Maintainer: Todd Lipcon Installed-Size: 476 Depends: hadoop-0.20 (= 0.20.2+228-1~jaunty-cdh3b1) Provides: hadoop-pipes Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-pipes_0.20.2+228-1~jaunty-cdh3b1_amd64.deb Size: 216504 SHA256: 38ae8044e0fdb043d8b068bb3b3ffe654ec542558ac38d4af7c99ee5a33dc767 SHA1: f11e5cdea60708c2abd08cc2867ad50b629e4117 MD5sum: 14fcde47ac88279213c96734a3aaa269 Description: Interface to author Hadoop MapReduce jobs in C++ Contains Hadoop Pipes, a library which allows Hadoop MapReduce jobs to be written in C++. Package: hadoop-0.20-secondarynamenode Source: hadoop-0.20 Version: 0.20.2+228-1~jaunty-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 204 Depends: hadoop-0.20 (= 0.20.2+228-1~jaunty-cdh3b1) Provides: hadoop-secondarynamenode Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-secondarynamenode_0.20.2+228-1~jaunty-cdh3b1_all.deb Size: 155902 SHA256: 331c05a0e4cc4725323babb87b7abf289889bd1ba9fd7536e9db65365e87051d SHA1: ecab325d23fad74c9d76ec352be27dae3e834cd7 MD5sum: 3007c1be824ffba5531cd4503832983e Description: Secondary Name Node for Hadoop The Secondary Name Node is responsible for checkpointing file system images. It is _not_ a failover pair for the namenode, and may safely be run on the same machine. Package: hadoop-0.20-source Source: hadoop-0.20 Version: 0.20.2+228-1~jaunty-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 53000 Provides: hadoop-source Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-source_0.20.2+228-1~jaunty-cdh3b1_all.deb Size: 16903578 SHA256: ca4921aded32383a37302bd336bc33fd68011b27a1b1ab7a10babce542b2d0c0 SHA1: 3ccf478c7b83dccff5ff006abcea7e363de628bf MD5sum: 1597b311de7760655eee25fd72a29230 Description: Source code for Hadoop This package contains the source code for Hadoop and its contrib modules. Package: hadoop-0.20-tasktracker Source: hadoop-0.20 Version: 0.20.2+228-1~jaunty-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 204 Depends: hadoop-0.20 (= 0.20.2+228-1~jaunty-cdh3b1) Provides: hadoop-tasktracker Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-tasktracker_0.20.2+228-1~jaunty-cdh3b1_all.deb Size: 155872 SHA256: b3b49a25bbd32bd9f15fdb55929daa7035a0980db7580eee494a385b0014918e SHA1: 0d205e24b5a767458892cfd1b49d01cd9cb54f27 MD5sum: f26d9d6bb3a1e4d7e024d4233c1793dc Description: Task Tracker for Hadoop The Task Tracker is the Hadoop service that accepts MapReduce tasks and computes results. Each node in a Hadoop cluster that should be doing computation should run a Task Tracker. Package: hadoop-hive Version: 0.5.0+20-1~jaunty-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 13132 Depends: sun-java6-jre | sun-java6-sdk, hadoop Homepage: http://hadoop.apache.org/hive/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-hive/hadoop-hive_0.5.0+20-1~jaunty-cdh3b1_all.deb Size: 11110368 SHA256: 6829d426bc1ae8100c7ecc29f4b80c721830da6db92d66064d1e2eab7b8cd018 SHA1: cf10affc0ac9aa1ac5e75f612e3489efbab58c46 MD5sum: 4eed163f350c72a7017b233094454c9b Description: A data warehouse infrastructure built on top of Hadoop Hive is a data warehouse infrastructure built on top of Hadoop that provides tools to enable easy data summarization, adhoc querying and analysis of large datasets data stored in Hadoop files. It provides a mechanism to put structure on this data and it also provides a simple query language called Hive QL which is based on SQL and which enables users familiar with SQL to query this data. At the same time, this language also allows traditional map/reduce programmers to be able to plug in their custom mappers and reducers to do more sophisticated analysis which may not be supported by the built-in capabilities of the language. Package: hadoop-pig Version: 0.5.0+30-1~jaunty-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 79756 Depends: sun-java6-jre | sun-java6-sdk, hadoop Homepage: http://hadoop.apache.org/pig/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-pig/hadoop-pig_0.5.0+30-1~jaunty-cdh3b1_all.deb Size: 30156188 SHA256: 2fad50c9bccac5bf20860f59e076878622fa9d5ce71fb6138ea352ef3f5df03f SHA1: 44b2d2b646c5eb8f4edc38549114659048ed969a MD5sum: cdefebe16d6067f41e28ebbb18c64ecb Description: A platform for analyzing large data sets using Hadoop Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, coupled with infrastructure for evaluating these programs. The salient property of Pig programs is that their structure is amenable to substantial parallelization, which in turns enables them to handle very large data sets. . At the present time, Pig's infrastructure layer consists of a compiler that produces sequences of Map-Reduce programs, for which large-scale parallel implementations already exist (e.g., the Hadoop subproject). Pig's language layer currently consists of a textual language called Pig Latin, which has the following key properties: . * Ease of programming It is trivial to achieve parallel execution of simple, "embarrassingly parallel" data analysis tasks. Complex tasks comprised of multiple interrelated data transformations are explicitly encoded as data flow sequences, making them easy to write, understand, and maintain. * Optimization opportunities The way in which tasks are encoded permits the system to optimize their execution automatically, allowing the user to focus on semantics rather than efficiency. * Extensibility Users can create their own functions to do special-purpose processing. Package: libhdfs0 Source: hadoop-0.20 Version: 0.20.2+228-1~jaunty-cdh3b1 Architecture: amd64 Maintainer: Todd Lipcon Installed-Size: 240 Depends: hadoop-0.20 (= 0.20.2+228-1~jaunty-cdh3b1), libc6 (>= 2.4) Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/libhdfs0_0.20.2+228-1~jaunty-cdh3b1_amd64.deb Size: 171332 SHA256: 6e007bd7c2893b1a9cc499cec854a7c518c2801dca501d8359c109d60c835dfc SHA1: 07483ffc4e17fac0c694c2f94d920f279c03355b MD5sum: 7d1a6551c7b9f19393dad2ef0806d2a9 Description: JNI Bindings to access Hadoop HDFS from C See http://wiki.apache.org/hadoop/LibHDFS Package: libhdfs0-dev Source: hadoop-0.20 Version: 0.20.2+228-1~jaunty-cdh3b1 Architecture: amd64 Maintainer: Todd Lipcon Installed-Size: 228 Depends: hadoop-0.20 (= 0.20.2+228-1~jaunty-cdh3b1), libhdfs0 (= 0.20.2+228-1~jaunty-cdh3b1) Homepage: http://hadoop.apache.org/core/ Priority: extra Section: libdevel Filename: pool/contrib/h/hadoop-0.20/libhdfs0-dev_0.20.2+228-1~jaunty-cdh3b1_amd64.deb Size: 165472 SHA256: 988e941d374f9649c10c91dc7a247e3ed37cb535d2841c0f2d6b41df62174804 SHA1: 276a24fe5e4a5c30fa77adf6061617dd3386c77a MD5sum: 9683d3312b2ce78d96ce8629ad802017 Description: Development support for libhdfs0 Includes examples and header files for accessing HDFS from C Package: python-hive Source: hadoop-hive Version: 0.5.0+20-1~jaunty-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 700 Depends: python (>= 2.4), python-support (>= 0.7.1) Provides: python2.5-hive, python2.6-hive Homepage: http://hadoop.apache.org/hive/ Priority: extra Section: python Filename: pool/contrib/h/hadoop-hive/python-hive_0.5.0+20-1~jaunty-cdh3b1_all.deb Size: 54192 SHA256: fc8013064d5468c2a697d5d89c34c51c8ac714faaba962b7a7164866a5ead227 SHA1: 0220e21c579c9f7af0dde8c8ab5e5ae8cbe17612 MD5sum: ef782f8ade61ee8b8e969517a6aa574c Description: Python client library to talk to the Hive Metastore This is a generated Thrift client to talk to the Hive Metastore.