Package: hadoop-0.20 Version: 0.20.1+169.127-1~lucid-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 42524 Depends: adduser, sun-java6-jre, sun-java6-bin Recommends: hadoop-0.20-native Provides: hadoop Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20_0.20.1+169.127-1~lucid-cdh2_all.deb Size: 22099686 SHA256: 681f55d4b4a2499871cf726f05883d1892f6b8a7a4e00782ef6f14de1b465269 SHA1: 65c31bafa20c4a9a0d76503c57280fecae255425 MD5sum: ece63869123511b03139f0a258d00a2a Description: A software platform for processing vast amounts of data Hadoop is a software platform that lets one easily write and run applications that process vast amounts of data. . Here's what makes Hadoop especially useful: * Scalable: Hadoop can reliably store and process petabytes. * Economical: It distributes the data and processing across clusters of commonly available computers. These clusters can number into the thousands of nodes. * Efficient: By distributing the data, Hadoop can process it in parallel on the nodes where the data is located. This makes it extremely rapid. * Reliable: Hadoop automatically maintains multiple copies of data and automatically redeploys computing tasks based on failures. . Hadoop implements MapReduce, using the Hadoop Distributed File System (HDFS). MapReduce divides applications into many small blocks of work. HDFS creates multiple replicas of data blocks for reliability, placing them on compute nodes around the cluster. MapReduce can then process the data where it is located. Package: hadoop-0.20-conf-pseudo Source: hadoop-0.20 Version: 0.20.1+169.127-1~lucid-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 268 Depends: hadoop-0.20 (= 0.20.1+169.127-1~lucid-cdh2) Provides: hadoop-conf-pseudo Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-conf-pseudo_0.20.1+169.127-1~lucid-cdh2_all.deb Size: 143654 SHA256: decc87b273605457b2a32ff0bc8c4d7f50f9a8410a61f9eee6a6b3c341f364d4 SHA1: 49d2b667616c3d4da7384812205e8ff10047f6dc MD5sum: 8f3d64f5709368c719e7305cb32a3736 Description: Pseudo-distributed Hadoop configuration Contains configuration files for a "pseudo-distributed" Hadoop deployment. In this mode, each of the hadoop components runs as a separate Java process, but all on the same machine. Package: hadoop-0.20-datanode Source: hadoop-0.20 Version: 0.20.1+169.127-1~lucid-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 188 Depends: hadoop-0.20 (= 0.20.1+169.127-1~lucid-cdh2) Provides: hadoop-datanode Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-datanode_0.20.1+169.127-1~lucid-cdh2_all.deb Size: 136316 SHA256: cc64284208678476a4e697b22314ff8091361574102965ac0405ee45f2c68629 SHA1: 477f161cf3190f1a146f35f717253041745d4930 MD5sum: fcb38167ce05bd2bc3c84f4859a607dd Description: Data Node for Hadoop The Data Nodes in the Hadoop Cluster are responsible for serving up blocks of data over the network to Hadoop Distributed Filesystem (HDFS) clients. Package: hadoop-0.20-doc Source: hadoop-0.20 Version: 0.20.1+169.127-1~lucid-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 47288 Provides: hadoop-doc Homepage: http://hadoop.apache.org/core/ Priority: extra Section: doc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-doc_0.20.1+169.127-1~lucid-cdh2_all.deb Size: 5734598 SHA256: f4ef99b939fffb12f4c6bdd3a64465873c6591f0a7f1709a8e3e1d57eb8945bf SHA1: debabbd4a629bb2ae7558f00b88cf9418dec93bc MD5sum: 29e6f74cad31a8ea90be86f5eabc32b4 Description: Documentation for Hadoop This package contains the Java Documentation for Hadoop and its relevant APIs. Package: hadoop-0.20-fuse Source: hadoop-0.20 Version: 0.20.1+169.127-1~lucid-cdh2 Architecture: amd64 Maintainer: Todd Lipcon Installed-Size: 296 Depends: libc6 (>= 2.7), libfuse2 (>= 2.8.1), libhdfs0, hadoop-0.20 (= 0.20.1+169.127-1~lucid-cdh2), fuse-utils Enhances: hadoop-0.20 Provides: hadoop-fuse Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-fuse_0.20.1+169.127-1~lucid-cdh2_amd64.deb Size: 168844 SHA256: beca369eb7c23aed6c0fc740e5eb992b91cfb65cca77113c24db747ac7c5fc55 SHA1: dbbe26dbc297e24098b0b57a33353d2e00a6bea4 MD5sum: d5af1a03db451b15de9b7bf31d3f9342 Description: HDFS exposed over a Filesystem in Userspace These projects (enumerated below) allow HDFS to be mounted (on most flavors of Unix) as a standard file system using the mount command. Once mounted, the user can operate on an instance of hdfs using standard Unix utilities such as 'ls', 'cd', 'cp', 'mkdir', 'find', 'grep', or use standard Posix libraries like open, write, read, close from C, C++, Python, Ruby, Perl, Java, bash, etc. Package: hadoop-0.20-jobtracker Source: hadoop-0.20 Version: 0.20.1+169.127-1~lucid-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 188 Depends: hadoop-0.20 (= 0.20.1+169.127-1~lucid-cdh2) Provides: hadoop-jobtracker Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-jobtracker_0.20.1+169.127-1~lucid-cdh2_all.deb Size: 136354 SHA256: 093c5dda96aee7978d12ca872f908e133aa4d42b3e55ecfde441fb8d12070057 SHA1: 6749bf53ee484403b872cc2fab461b9f0c2f4258 MD5sum: e688f91e2c6695eb17de15fc194e590e Description: Job Tracker for Hadoop The jobtracker is a central service which is responsible for managing the tasktracker services running on all nodes in a Hadoop Cluster. The jobtracker allocates work to the tasktracker nearest to the data with an available work slot. Package: hadoop-0.20-namenode Source: hadoop-0.20 Version: 0.20.1+169.127-1~lucid-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 188 Depends: hadoop-0.20 (= 0.20.1+169.127-1~lucid-cdh2) Provides: hadoop-namenode Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-namenode_0.20.1+169.127-1~lucid-cdh2_all.deb Size: 136314 SHA256: 7034547830d9db02b03ea0db2733900f5d3ee645845e04d1faf6449d4fe92493 SHA1: bb5e11195ffa76cc19c42cc8f734ed2c77b782c8 MD5sum: 06c78574e95c0264dcdcbe96d44ced16 Description: Name Node for Hadoop The Hadoop Distributed Filesystem (HDFS) requires one unique server, the namenode, which manages the block locations of files on the filesystem. Package: hadoop-0.20-native Source: hadoop-0.20 Version: 0.20.1+169.127-1~lucid-cdh2 Architecture: amd64 Maintainer: Todd Lipcon Installed-Size: 256 Depends: libc6 (>= 2.4), hadoop-0.20 (= 0.20.1+169.127-1~lucid-cdh2), liblzo2-2, libz1 Enhances: hadoop-0.20 Provides: hadoop-native Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-native_0.20.1+169.127-1~lucid-cdh2_amd64.deb Size: 144884 SHA256: 8da61a4e9cbfc9cf8807b3f818266237b796fa1dc7bf05ff8f8d239c519768b1 SHA1: e0fbe1d772be73dfca438ef7b5e591fb5b23069c MD5sum: 8f5791138b2fcb9a956bfb1b44758d88 Description: Native libraries for Hadoop (e.g., compression) This optional package contains native libraries that increase the performance of Hadoop's compression. Package: hadoop-0.20-pipes Source: hadoop-0.20 Version: 0.20.1+169.127-1~lucid-cdh2 Architecture: amd64 Maintainer: Todd Lipcon Installed-Size: 456 Depends: hadoop-0.20 (= 0.20.1+169.127-1~lucid-cdh2) Provides: hadoop-pipes Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-pipes_0.20.1+169.127-1~lucid-cdh2_amd64.deb Size: 195516 SHA256: abc59bff9f2fdb5c33c9bf306089d5de6dd9cf92210e9775d4c3b14184d89d8d SHA1: d8b872c14234231725d4dd3cd1b212674d608afe MD5sum: 69f2c6cb4437177edde9a4d65fd4b74a Description: Interface to author Hadoop MapReduce jobs in C++ Contains Hadoop Pipes, a library which allows Hadoop MapReduce jobs to be written in C++. Package: hadoop-0.20-secondarynamenode Source: hadoop-0.20 Version: 0.20.1+169.127-1~lucid-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 188 Depends: hadoop-0.20 (= 0.20.1+169.127-1~lucid-cdh2) Provides: hadoop-secondarynamenode Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-secondarynamenode_0.20.1+169.127-1~lucid-cdh2_all.deb Size: 136352 SHA256: 018d9464a1f03b18ae3c16f9fa6467a786ce47946b89f5fa3e174fecf0d76a30 SHA1: dfaf45ef2525cb2dfb26ebedb6b964297d1c8ff6 MD5sum: a26d7ea5df0638da5b126beee919e0e4 Description: Secondary Name Node for Hadoop The Secondary Name Node is responsible for checkpointing file system images. It is _not_ a failover pair for the namenode, and may safely be run on the same machine. Package: hadoop-0.20-source Source: hadoop-0.20 Version: 0.20.1+169.127-1~lucid-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 52264 Provides: hadoop-source Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-source_0.20.1+169.127-1~lucid-cdh2_all.deb Size: 16711932 SHA256: 3857c520710d4080725fa999a213305047c7f5a4a5133db1feccba37f7504594 SHA1: bfce5ae097dad379b5ee66dd39ae8db5df62cc29 MD5sum: 3103716deeddacdff37007a7f216a972 Description: Source code for Hadoop This package contains the source code for Hadoop and its contrib modules. Package: hadoop-0.20-tasktracker Source: hadoop-0.20 Version: 0.20.1+169.127-1~lucid-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 188 Depends: hadoop-0.20 (= 0.20.1+169.127-1~lucid-cdh2) Provides: hadoop-tasktracker Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-tasktracker_0.20.1+169.127-1~lucid-cdh2_all.deb Size: 136332 SHA256: 2582a6f96133d78a3dafceb59b509d0083ceef6153666eeaae16a72015cc8bcf SHA1: b351ed6eeaf7d55b02c69b4e092bd9fb9bdde9e2 MD5sum: f9722b1cd973c84abaad79241cc9b672 Description: Task Tracker for Hadoop The Task Tracker is the Hadoop service that accepts MapReduce tasks and computes results. Each node in a Hadoop cluster that should be doing computation should run a Task Tracker. Package: hadoop-hive Version: 0.4.1+14.5-2~lucid-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 12324 Depends: sun-java6-jre | sun-java6-sdk, hadoop Homepage: http://hadoop.apache.org/hive/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-hive/hadoop-hive_0.4.1+14.5-2~lucid-cdh2_all.deb Size: 10741050 SHA256: 6fb3e1ab1aeefdcbb8f775eb11139caae37ef6c89d5e9cb8129c30a76d781dbc SHA1: 1689898215d99160bf6cbeef67d350d02c14c1af MD5sum: 23712dc5108d8e73c38b379f26eb286e Description: A data warehouse infrastructure built on top of Hadoop Hive is a data warehouse infrastructure built on top of Hadoop that provides tools to enable easy data summarization, adhoc querying and analysis of large datasets data stored in Hadoop files. It provides a mechanism to put structure on this data and it also provides a simple query language called Hive QL which is based on SQL and which enables users familiar with SQL to query this data. At the same time, this language also allows traditional map/reduce programmers to be able to plug in their custom mappers and reducers to do more sophisticated analysis which may not be supported by the built-in capabilities of the language. Package: hadoop-pig Version: 0.5.0+11.19-1~lucid-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 79764 Depends: sun-java6-jre | sun-java6-sdk, hadoop Homepage: http://hadoop.apache.org/pig/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-pig/hadoop-pig_0.5.0+11.19-1~lucid-cdh2_all.deb Size: 30163068 SHA256: 8e34e621fe0f3835ea7033173171475cae4173333248f3dcaaf1468a7b0445e0 SHA1: a43f37ddbda428b5bb91c74ca14f16602fd9ce06 MD5sum: a832c65cb91b888ffd5e0e26e7a004f7 Description: A platform for analyzing large data sets using Hadoop Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, coupled with infrastructure for evaluating these programs. The salient property of Pig programs is that their structure is amenable to substantial parallelization, which in turns enables them to handle very large data sets. . At the present time, Pig's infrastructure layer consists of a compiler that produces sequences of Map-Reduce programs, for which large-scale parallel implementations already exist (e.g., the Hadoop subproject). Pig's language layer currently consists of a textual language called Pig Latin, which has the following key properties: . * Ease of programming It is trivial to achieve parallel execution of simple, "embarrassingly parallel" data analysis tasks. Complex tasks comprised of multiple interrelated data transformations are explicitly encoded as data flow sequences, making them easy to write, understand, and maintain. * Optimization opportunities The way in which tasks are encoded permits the system to optimize their execution automatically, allowing the user to focus on semantics rather than efficiency. * Extensibility Users can create their own functions to do special-purpose processing. Package: libhdfs0 Source: hadoop-0.20 Version: 0.20.1+169.127-1~lucid-cdh2 Architecture: amd64 Maintainer: Todd Lipcon Installed-Size: 224 Depends: hadoop-0.20 (= 0.20.1+169.127-1~lucid-cdh2), libc6 (>= 2.4) Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/libhdfs0_0.20.1+169.127-1~lucid-cdh2_amd64.deb Size: 151644 SHA256: a1eacc9fb5d90167a519b4c34c0d5ad4861f3e8c9cb4c03af37727257a045e5b SHA1: 4da2a01a3c3acfc569c997744444d3434adc997e MD5sum: ea0fe818f3636ff9d117dd4f278c8c08 Description: JNI Bindings to access Hadoop HDFS from C See http://wiki.apache.org/hadoop/LibHDFS Package: libhdfs0-dev Source: hadoop-0.20 Version: 0.20.1+169.127-1~lucid-cdh2 Architecture: amd64 Maintainer: Todd Lipcon Installed-Size: 212 Depends: hadoop-0.20 (= 0.20.1+169.127-1~lucid-cdh2), libhdfs0 (= 0.20.1+169.127-1~lucid-cdh2) Homepage: http://hadoop.apache.org/core/ Priority: extra Section: libdevel Filename: pool/contrib/h/hadoop-0.20/libhdfs0-dev_0.20.1+169.127-1~lucid-cdh2_amd64.deb Size: 145720 SHA256: 6c7c85a077903dd6aa6810fa8e7536f110026bd7b529c26181e2bf5b9227e2ed SHA1: f42259242ac98f8dfd6b628c9fb4466cb068ff6a MD5sum: d0cb5536ccc09489c08a6c0c749119a9 Description: Development support for libhdfs0 Includes examples and header files for accessing HDFS from C Package: python-hive Source: hadoop-hive Version: 0.4.1+14.5-2~lucid-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 636 Depends: python (>= 2.4), python-support (>= 0.90.0) Provides: python2.6-hive Homepage: http://hadoop.apache.org/hive/ Priority: extra Section: python Filename: pool/contrib/h/hadoop-hive/python-hive_0.4.1+14.5-2~lucid-cdh2_all.deb Size: 49040 SHA256: 0b8ae5ba911f72869615b8e60364239249c97a1565ac2b3cbf357d04d09f34b8 SHA1: d40071df47c766bd9d8b92da7d056336333bb687 MD5sum: 104fed7cab0139788a71e3587ee65869 Description: Python client library to talk to the Hive Metastore This is a generated Thrift client to talk to the Hive Metastore.