Package: hadoop-0.20 Version: 0.20.2+228-1~karmic-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 44332 Depends: adduser, sun-java6-jre, sun-java6-bin Recommends: hadoop-0.20-native Provides: hadoop Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20_0.20.2+228-1~karmic-cdh3b1_all.deb Size: 22276764 SHA256: a0b85dbbc0abc9a143e7858e2e7243ded0e219d15e19b4a2587cd306a8762868 SHA1: 51206ff99ec2643c8c3c096a246ab0c4d75da09e MD5sum: 5fed695be5b4e7ebf30576320611bba2 Description: A software platform for processing vast amounts of data Hadoop is a software platform that lets one easily write and run applications that process vast amounts of data. . Here's what makes Hadoop especially useful: * Scalable: Hadoop can reliably store and process petabytes. * Economical: It distributes the data and processing across clusters of commonly available computers. These clusters can number into the thousands of nodes. * Efficient: By distributing the data, Hadoop can process it in parallel on the nodes where the data is located. This makes it extremely rapid. * Reliable: Hadoop automatically maintains multiple copies of data and automatically redeploys computing tasks based on failures. . Hadoop implements MapReduce, using the Hadoop Distributed File System (HDFS). MapReduce divides applications into many small blocks of work. HDFS creates multiple replicas of data blocks for reliability, placing them on compute nodes around the cluster. MapReduce can then process the data where it is located. Package: hadoop-0.20-conf-pseudo Source: hadoop-0.20 Version: 0.20.2+228-1~karmic-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 284 Depends: hadoop-0.20 (= 0.20.2+228-1~karmic-cdh3b1) Provides: hadoop-conf-pseudo Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-conf-pseudo_0.20.2+228-1~karmic-cdh3b1_all.deb Size: 163210 SHA256: d0ef10cf36cc9678a4f29ea47a7deff8e4172e31b414a5f74410206ea80d861b SHA1: 8270e83da3c67e3eb178a01d7f902be091ad62c0 MD5sum: 18590a53f395fa6234c3c25c48a3a60a Description: Pseudo-distributed Hadoop configuration Contains configuration files for a "pseudo-distributed" Hadoop deployment. In this mode, each of the hadoop components runs as a separate Java process, but all on the same machine. Package: hadoop-0.20-datanode Source: hadoop-0.20 Version: 0.20.2+228-1~karmic-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 204 Depends: hadoop-0.20 (= 0.20.2+228-1~karmic-cdh3b1) Provides: hadoop-datanode Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-datanode_0.20.2+228-1~karmic-cdh3b1_all.deb Size: 155892 SHA256: 2c60532b5df06193059a3103a3426b3ca474ec7191eb9f80994950733ceb7efd SHA1: 9e73fa9ac15f8733af2b87d46facca642e1fab3a MD5sum: af9635f9d601e62d502a36ac07888b58 Description: Data Node for Hadoop The Data Nodes in the Hadoop Cluster are responsible for serving up blocks of data over the network to Hadoop Distributed Filesystem (HDFS) clients. Package: hadoop-0.20-doc Source: hadoop-0.20 Version: 0.20.2+228-1~karmic-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 48456 Provides: hadoop-doc Homepage: http://hadoop.apache.org/core/ Priority: extra Section: doc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-doc_0.20.2+228-1~karmic-cdh3b1_all.deb Size: 5831858 SHA256: 0e5ace2af33afc2856dfa7e1dc5d97d6fe6eb62374a5ad4e1abe373daab3f80c SHA1: cea0c73c23cc796671fb1ee1569362ef9b489b3a MD5sum: fdb713f257324f1d87c727b00af67b57 Description: Documentation for Hadoop This package contains the Java Documentation for Hadoop and its relevant APIs. Package: hadoop-0.20-jobtracker Source: hadoop-0.20 Version: 0.20.2+228-1~karmic-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 204 Depends: hadoop-0.20 (= 0.20.2+228-1~karmic-cdh3b1) Provides: hadoop-jobtracker Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-jobtracker_0.20.2+228-1~karmic-cdh3b1_all.deb Size: 155930 SHA256: d39249c90b8b96a6dd1c87995bb11215aed357d5d09692cce5e4faf1a81d5b24 SHA1: 530d469e18790f540414ab8c48d5227a94d20c59 MD5sum: d100da3501d20ca149ba5ec39c71d884 Description: Job Tracker for Hadoop The jobtracker is a central service which is responsible for managing the tasktracker services running on all nodes in a Hadoop Cluster. The jobtracker allocates work to the tasktracker nearest to the data with an available work slot. Package: hadoop-0.20-namenode Source: hadoop-0.20 Version: 0.20.2+228-1~karmic-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 204 Depends: hadoop-0.20 (= 0.20.2+228-1~karmic-cdh3b1) Provides: hadoop-namenode Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-namenode_0.20.2+228-1~karmic-cdh3b1_all.deb Size: 155886 SHA256: 5e8b2b9b9ecf9e6c45c87144017c8d3e9117a948ea77b041269661262ee25c52 SHA1: e6ee83cf8418d45a3b1405b57becc01fce77f62d MD5sum: 5b5b8edacc29da91952d94cf9a5bae54 Description: Name Node for Hadoop The Hadoop Distributed Filesystem (HDFS) requires one unique server, the namenode, which manages the block locations of files on the filesystem. Package: hadoop-0.20-native Source: hadoop-0.20 Version: 0.20.2+228-1~karmic-cdh3b1 Architecture: amd64 Maintainer: Todd Lipcon Installed-Size: 272 Depends: libc6 (>= 2.4), hadoop-0.20 (= 0.20.2+228-1~karmic-cdh3b1), liblzo2-2, libz1 Enhances: hadoop-0.20 Provides: hadoop-native Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-native_0.20.2+228-1~karmic-cdh3b1_amd64.deb Size: 164814 SHA256: 7c8105c130a3bc5b889fbecaa90c9792b8c4aeaacf816a14b3cae6507e4b08d3 SHA1: 267359cd45dc9f7848eaea970a45f0e07054b823 MD5sum: 179bc3d0ee96ec3b3e0d1145761bce4f Description: Native libraries for Hadoop (e.g., compression) This optional package contains native libraries that increase the performance of Hadoop's compression. Package: hadoop-0.20-pipes Source: hadoop-0.20 Version: 0.20.2+228-1~karmic-cdh3b1 Architecture: amd64 Maintainer: Todd Lipcon Installed-Size: 472 Depends: hadoop-0.20 (= 0.20.2+228-1~karmic-cdh3b1) Provides: hadoop-pipes Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-pipes_0.20.2+228-1~karmic-cdh3b1_amd64.deb Size: 215860 SHA256: a1d8d419ff3f8909e8b0f74cbabcf771b298e6bcf93fb34b4f8a686cbe3e2d93 SHA1: 6b7dcd46cf5eb43413c8f58556d02cc25250762c MD5sum: 1fa61698b8ae3a266c58e89e94d3e67d Description: Interface to author Hadoop MapReduce jobs in C++ Contains Hadoop Pipes, a library which allows Hadoop MapReduce jobs to be written in C++. Package: hadoop-0.20-secondarynamenode Source: hadoop-0.20 Version: 0.20.2+228-1~karmic-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 204 Depends: hadoop-0.20 (= 0.20.2+228-1~karmic-cdh3b1) Provides: hadoop-secondarynamenode Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-secondarynamenode_0.20.2+228-1~karmic-cdh3b1_all.deb Size: 155932 SHA256: a629c8c7e5825786a1351d01627c4b9a5ea08534a6f9b7fadaf56e54f0045e11 SHA1: bf7aa560af70e356d2b1457c6dcb62cfd39b2700 MD5sum: 76b360a28670b638a3396aeeca808822 Description: Secondary Name Node for Hadoop The Secondary Name Node is responsible for checkpointing file system images. It is _not_ a failover pair for the namenode, and may safely be run on the same machine. Package: hadoop-0.20-source Source: hadoop-0.20 Version: 0.20.2+228-1~karmic-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 52848 Provides: hadoop-source Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-source_0.20.2+228-1~karmic-cdh3b1_all.deb Size: 16902912 SHA256: f2d5296fdc692f3c1ef4491f87bc508d379ddd968aa92b449f80775d1a613133 SHA1: 9e35430adbf5df6b4e977c0d6cbed1d036569d3b MD5sum: 8d36d95fc7ab31004e9042a982c8d927 Description: Source code for Hadoop This package contains the source code for Hadoop and its contrib modules. Package: hadoop-0.20-tasktracker Source: hadoop-0.20 Version: 0.20.2+228-1~karmic-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 204 Depends: hadoop-0.20 (= 0.20.2+228-1~karmic-cdh3b1) Provides: hadoop-tasktracker Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-tasktracker_0.20.2+228-1~karmic-cdh3b1_all.deb Size: 155904 SHA256: ac688124744b57ef5226a02037bb7d7442181ed61936193a0a1367fb15312f4f SHA1: 2b71daa521b8e19e4d3989043122594e85d9fa40 MD5sum: c85665bc07edf102696f009752410fa2 Description: Task Tracker for Hadoop The Task Tracker is the Hadoop service that accepts MapReduce tasks and computes results. Each node in a Hadoop cluster that should be doing computation should run a Task Tracker. Package: hadoop-hive Version: 0.5.0+20-1~karmic-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 13132 Depends: sun-java6-jre | sun-java6-sdk, hadoop Homepage: http://hadoop.apache.org/hive/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-hive/hadoop-hive_0.5.0+20-1~karmic-cdh3b1_all.deb Size: 11109004 SHA256: 4a12a2429194d45a6eb721cb77ab2f55571142ba67ddf50de004731f9f7d0c7f SHA1: 46da18896af24c3283d7a78a1268d292aae6866a MD5sum: b0c76c64e4cb2b489ebe92d20e36f4ea Description: A data warehouse infrastructure built on top of Hadoop Hive is a data warehouse infrastructure built on top of Hadoop that provides tools to enable easy data summarization, adhoc querying and analysis of large datasets data stored in Hadoop files. It provides a mechanism to put structure on this data and it also provides a simple query language called Hive QL which is based on SQL and which enables users familiar with SQL to query this data. At the same time, this language also allows traditional map/reduce programmers to be able to plug in their custom mappers and reducers to do more sophisticated analysis which may not be supported by the built-in capabilities of the language. Package: hadoop-pig Version: 0.5.0+30-1~karmic-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 79756 Depends: sun-java6-jre | sun-java6-sdk, hadoop Homepage: http://hadoop.apache.org/pig/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-pig/hadoop-pig_0.5.0+30-1~karmic-cdh3b1_all.deb Size: 30156006 SHA256: af5d4a10a0ca93d232ddd99a76da566d713d693abd7fe50f747b1d98b7cf7633 SHA1: 803e96102754d30b890147f34902caf28956449b MD5sum: a28ba2f603b514451270a84661e19252 Description: A platform for analyzing large data sets using Hadoop Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, coupled with infrastructure for evaluating these programs. The salient property of Pig programs is that their structure is amenable to substantial parallelization, which in turns enables them to handle very large data sets. . At the present time, Pig's infrastructure layer consists of a compiler that produces sequences of Map-Reduce programs, for which large-scale parallel implementations already exist (e.g., the Hadoop subproject). Pig's language layer currently consists of a textual language called Pig Latin, which has the following key properties: . * Ease of programming It is trivial to achieve parallel execution of simple, "embarrassingly parallel" data analysis tasks. Complex tasks comprised of multiple interrelated data transformations are explicitly encoded as data flow sequences, making them easy to write, understand, and maintain. * Optimization opportunities The way in which tasks are encoded permits the system to optimize their execution automatically, allowing the user to focus on semantics rather than efficiency. * Extensibility Users can create their own functions to do special-purpose processing. Package: libhdfs0 Source: hadoop-0.20 Version: 0.20.2+228-1~karmic-cdh3b1 Architecture: amd64 Maintainer: Todd Lipcon Installed-Size: 240 Depends: hadoop-0.20 (= 0.20.2+228-1~karmic-cdh3b1), libc6 (>= 2.4) Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/libhdfs0_0.20.2+228-1~karmic-cdh3b1_amd64.deb Size: 171274 SHA256: 7c113e42ea101cb3d6b0948de1af1437144c6131b49d5462291428e881904404 SHA1: b950a3a1f3d80db9a0cec353b8b65a28132bf3d4 MD5sum: 58312b584e5bfdfd2f7ed0ba5108d1b6 Description: JNI Bindings to access Hadoop HDFS from C See http://wiki.apache.org/hadoop/LibHDFS Package: libhdfs0-dev Source: hadoop-0.20 Version: 0.20.2+228-1~karmic-cdh3b1 Architecture: amd64 Maintainer: Todd Lipcon Installed-Size: 228 Depends: hadoop-0.20 (= 0.20.2+228-1~karmic-cdh3b1), libhdfs0 (= 0.20.2+228-1~karmic-cdh3b1) Homepage: http://hadoop.apache.org/core/ Priority: extra Section: libdevel Filename: pool/contrib/h/hadoop-0.20/libhdfs0-dev_0.20.2+228-1~karmic-cdh3b1_amd64.deb Size: 164640 SHA256: 541d42effe08462b82b3c1e63fdf23354e2a34c841a29a1a3a7536d57058cd4b SHA1: 68fce5e0c36fa1ae26fb67da7f3ba8bb6baddc28 MD5sum: a56f6f2a5e639e32c28397e3564511ae Description: Development support for libhdfs0 Includes examples and header files for accessing HDFS from C Package: python-hive Source: hadoop-hive Version: 0.5.0+20-1~karmic-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 700 Depends: python (>= 2.4), python-support (>= 0.90.0) Provides: python2.5-hive, python2.6-hive Homepage: http://hadoop.apache.org/hive/ Priority: extra Section: python Filename: pool/contrib/h/hadoop-hive/python-hive_0.5.0+20-1~karmic-cdh3b1_all.deb Size: 54842 SHA256: 08647ea0567cf2560a813b70ccf9088994af895544f3569820932d466df5ed91 SHA1: e9547a715e62554039a84dae37803de502fb747d MD5sum: e99d01cbf3195ad2d3140f793988782b Description: Python client library to talk to the Hive Metastore This is a generated Thrift client to talk to the Hive Metastore.