Package: hadoop-0.20 Version: 0.20.1+169.127-1~lucid-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 42524 Depends: adduser, sun-java6-jre, sun-java6-bin Recommends: hadoop-0.20-native Provides: hadoop Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20_0.20.1+169.127-1~lucid-cdh2_all.deb Size: 22099686 SHA256: 681f55d4b4a2499871cf726f05883d1892f6b8a7a4e00782ef6f14de1b465269 SHA1: 65c31bafa20c4a9a0d76503c57280fecae255425 MD5sum: ece63869123511b03139f0a258d00a2a Description: A software platform for processing vast amounts of data Hadoop is a software platform that lets one easily write and run applications that process vast amounts of data. . Here's what makes Hadoop especially useful: * Scalable: Hadoop can reliably store and process petabytes. * Economical: It distributes the data and processing across clusters of commonly available computers. These clusters can number into the thousands of nodes. * Efficient: By distributing the data, Hadoop can process it in parallel on the nodes where the data is located. This makes it extremely rapid. * Reliable: Hadoop automatically maintains multiple copies of data and automatically redeploys computing tasks based on failures. . Hadoop implements MapReduce, using the Hadoop Distributed File System (HDFS). MapReduce divides applications into many small blocks of work. HDFS creates multiple replicas of data blocks for reliability, placing them on compute nodes around the cluster. MapReduce can then process the data where it is located. Package: hadoop-0.20-conf-pseudo Source: hadoop-0.20 Version: 0.20.1+169.127-1~lucid-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 268 Depends: hadoop-0.20 (= 0.20.1+169.127-1~lucid-cdh2) Provides: hadoop-conf-pseudo Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-conf-pseudo_0.20.1+169.127-1~lucid-cdh2_all.deb Size: 143654 SHA256: decc87b273605457b2a32ff0bc8c4d7f50f9a8410a61f9eee6a6b3c341f364d4 SHA1: 49d2b667616c3d4da7384812205e8ff10047f6dc MD5sum: 8f3d64f5709368c719e7305cb32a3736 Description: Pseudo-distributed Hadoop configuration Contains configuration files for a "pseudo-distributed" Hadoop deployment. In this mode, each of the hadoop components runs as a separate Java process, but all on the same machine. Package: hadoop-0.20-datanode Source: hadoop-0.20 Version: 0.20.1+169.127-1~lucid-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 188 Depends: hadoop-0.20 (= 0.20.1+169.127-1~lucid-cdh2) Provides: hadoop-datanode Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-datanode_0.20.1+169.127-1~lucid-cdh2_all.deb Size: 136316 SHA256: cc64284208678476a4e697b22314ff8091361574102965ac0405ee45f2c68629 SHA1: 477f161cf3190f1a146f35f717253041745d4930 MD5sum: fcb38167ce05bd2bc3c84f4859a607dd Description: Data Node for Hadoop The Data Nodes in the Hadoop Cluster are responsible for serving up blocks of data over the network to Hadoop Distributed Filesystem (HDFS) clients. Package: hadoop-0.20-doc Source: hadoop-0.20 Version: 0.20.1+169.127-1~lucid-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 47288 Provides: hadoop-doc Homepage: http://hadoop.apache.org/core/ Priority: extra Section: doc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-doc_0.20.1+169.127-1~lucid-cdh2_all.deb Size: 5734598 SHA256: f4ef99b939fffb12f4c6bdd3a64465873c6591f0a7f1709a8e3e1d57eb8945bf SHA1: debabbd4a629bb2ae7558f00b88cf9418dec93bc MD5sum: 29e6f74cad31a8ea90be86f5eabc32b4 Description: Documentation for Hadoop This package contains the Java Documentation for Hadoop and its relevant APIs. Package: hadoop-0.20-fuse Source: hadoop-0.20 Version: 0.20.1+169.127-1~lucid-cdh2 Architecture: i386 Maintainer: Todd Lipcon Installed-Size: 296 Depends: libc6 (>= 2.7), libfuse2 (>= 2.8.1), libhdfs0, hadoop-0.20 (= 0.20.1+169.127-1~lucid-cdh2), fuse-utils Enhances: hadoop-0.20 Provides: hadoop-fuse Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-fuse_0.20.1+169.127-1~lucid-cdh2_i386.deb Size: 166668 SHA256: c564eecd0fa3945ae3e72a757bf0b5346dd91669a18e1956c385097f6f719f73 SHA1: 264ccd4d042ad56597b07e6ebd574dadcca366fa MD5sum: ecffdb950b7e965f3238a118929d27c5 Description: HDFS exposed over a Filesystem in Userspace These projects (enumerated below) allow HDFS to be mounted (on most flavors of Unix) as a standard file system using the mount command. Once mounted, the user can operate on an instance of hdfs using standard Unix utilities such as 'ls', 'cd', 'cp', 'mkdir', 'find', 'grep', or use standard Posix libraries like open, write, read, close from C, C++, Python, Ruby, Perl, Java, bash, etc. Package: hadoop-0.20-jobtracker Source: hadoop-0.20 Version: 0.20.1+169.127-1~lucid-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 188 Depends: hadoop-0.20 (= 0.20.1+169.127-1~lucid-cdh2) Provides: hadoop-jobtracker Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-jobtracker_0.20.1+169.127-1~lucid-cdh2_all.deb Size: 136354 SHA256: 093c5dda96aee7978d12ca872f908e133aa4d42b3e55ecfde441fb8d12070057 SHA1: 6749bf53ee484403b872cc2fab461b9f0c2f4258 MD5sum: e688f91e2c6695eb17de15fc194e590e Description: Job Tracker for Hadoop The jobtracker is a central service which is responsible for managing the tasktracker services running on all nodes in a Hadoop Cluster. The jobtracker allocates work to the tasktracker nearest to the data with an available work slot. Package: hadoop-0.20-namenode Source: hadoop-0.20 Version: 0.20.1+169.127-1~lucid-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 188 Depends: hadoop-0.20 (= 0.20.1+169.127-1~lucid-cdh2) Provides: hadoop-namenode Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-namenode_0.20.1+169.127-1~lucid-cdh2_all.deb Size: 136314 SHA256: 7034547830d9db02b03ea0db2733900f5d3ee645845e04d1faf6449d4fe92493 SHA1: bb5e11195ffa76cc19c42cc8f734ed2c77b782c8 MD5sum: 06c78574e95c0264dcdcbe96d44ced16 Description: Name Node for Hadoop The Hadoop Distributed Filesystem (HDFS) requires one unique server, the namenode, which manages the block locations of files on the filesystem. Package: hadoop-0.20-native Source: hadoop-0.20 Version: 0.20.1+169.127-1~lucid-cdh2 Architecture: i386 Maintainer: Todd Lipcon Installed-Size: 252 Depends: libc6 (>= 2.4), hadoop-0.20 (= 0.20.1+169.127-1~lucid-cdh2), liblzo2-2, libz1 Enhances: hadoop-0.20 Provides: hadoop-native Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-native_0.20.1+169.127-1~lucid-cdh2_i386.deb Size: 144504 SHA256: 3227ca98f5723da4504d7e6f29d25dd21093fb2b57718f1cfb2218c93676cd71 SHA1: a24d84b9013ad12a9f7dc3d71f074bedba38ce8f MD5sum: 3fe409058b278832c36cabe2cd604977 Description: Native libraries for Hadoop (e.g., compression) This optional package contains native libraries that increase the performance of Hadoop's compression. Package: hadoop-0.20-pipes Source: hadoop-0.20 Version: 0.20.1+169.127-1~lucid-cdh2 Architecture: i386 Maintainer: Todd Lipcon Installed-Size: 384 Depends: hadoop-0.20 (= 0.20.1+169.127-1~lucid-cdh2) Provides: hadoop-pipes Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-pipes_0.20.1+169.127-1~lucid-cdh2_i386.deb Size: 192928 SHA256: 75a7c0579b2d853710a37e400decaf5a4b5f9197c27477e12a8d7d5233299ef5 SHA1: 9ae38dff9204cc68572456740ccc14e9e2823d14 MD5sum: a9f36208d09574fba878bf3b765c502c Description: Interface to author Hadoop MapReduce jobs in C++ Contains Hadoop Pipes, a library which allows Hadoop MapReduce jobs to be written in C++. Package: hadoop-0.20-secondarynamenode Source: hadoop-0.20 Version: 0.20.1+169.127-1~lucid-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 188 Depends: hadoop-0.20 (= 0.20.1+169.127-1~lucid-cdh2) Provides: hadoop-secondarynamenode Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-secondarynamenode_0.20.1+169.127-1~lucid-cdh2_all.deb Size: 136352 SHA256: 018d9464a1f03b18ae3c16f9fa6467a786ce47946b89f5fa3e174fecf0d76a30 SHA1: dfaf45ef2525cb2dfb26ebedb6b964297d1c8ff6 MD5sum: a26d7ea5df0638da5b126beee919e0e4 Description: Secondary Name Node for Hadoop The Secondary Name Node is responsible for checkpointing file system images. It is _not_ a failover pair for the namenode, and may safely be run on the same machine. Package: hadoop-0.20-source Source: hadoop-0.20 Version: 0.20.1+169.127-1~lucid-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 52264 Provides: hadoop-source Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-source_0.20.1+169.127-1~lucid-cdh2_all.deb Size: 16711932 SHA256: 3857c520710d4080725fa999a213305047c7f5a4a5133db1feccba37f7504594 SHA1: bfce5ae097dad379b5ee66dd39ae8db5df62cc29 MD5sum: 3103716deeddacdff37007a7f216a972 Description: Source code for Hadoop This package contains the source code for Hadoop and its contrib modules. Package: hadoop-0.20-tasktracker Source: hadoop-0.20 Version: 0.20.1+169.127-1~lucid-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 188 Depends: hadoop-0.20 (= 0.20.1+169.127-1~lucid-cdh2) Provides: hadoop-tasktracker Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-tasktracker_0.20.1+169.127-1~lucid-cdh2_all.deb Size: 136332 SHA256: 2582a6f96133d78a3dafceb59b509d0083ceef6153666eeaae16a72015cc8bcf SHA1: b351ed6eeaf7d55b02c69b4e092bd9fb9bdde9e2 MD5sum: f9722b1cd973c84abaad79241cc9b672 Description: Task Tracker for Hadoop The Task Tracker is the Hadoop service that accepts MapReduce tasks and computes results. Each node in a Hadoop cluster that should be doing computation should run a Task Tracker. Package: hadoop-hive Version: 0.4.1+14.5-2~lucid-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 12324 Depends: sun-java6-jre | sun-java6-sdk, hadoop Homepage: http://hadoop.apache.org/hive/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-hive/hadoop-hive_0.4.1+14.5-2~lucid-cdh2_all.deb Size: 10741050 SHA256: 6fb3e1ab1aeefdcbb8f775eb11139caae37ef6c89d5e9cb8129c30a76d781dbc SHA1: 1689898215d99160bf6cbeef67d350d02c14c1af MD5sum: 23712dc5108d8e73c38b379f26eb286e Description: A data warehouse infrastructure built on top of Hadoop Hive is a data warehouse infrastructure built on top of Hadoop that provides tools to enable easy data summarization, adhoc querying and analysis of large datasets data stored in Hadoop files. It provides a mechanism to put structure on this data and it also provides a simple query language called Hive QL which is based on SQL and which enables users familiar with SQL to query this data. At the same time, this language also allows traditional map/reduce programmers to be able to plug in their custom mappers and reducers to do more sophisticated analysis which may not be supported by the built-in capabilities of the language. Package: hadoop-pig Version: 0.5.0+11.19-1~lucid-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 79764 Depends: sun-java6-jre | sun-java6-sdk, hadoop Homepage: http://hadoop.apache.org/pig/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-pig/hadoop-pig_0.5.0+11.19-1~lucid-cdh2_all.deb Size: 30163068 SHA256: 8e34e621fe0f3835ea7033173171475cae4173333248f3dcaaf1468a7b0445e0 SHA1: a43f37ddbda428b5bb91c74ca14f16602fd9ce06 MD5sum: a832c65cb91b888ffd5e0e26e7a004f7 Description: A platform for analyzing large data sets using Hadoop Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, coupled with infrastructure for evaluating these programs. The salient property of Pig programs is that their structure is amenable to substantial parallelization, which in turns enables them to handle very large data sets. . At the present time, Pig's infrastructure layer consists of a compiler that produces sequences of Map-Reduce programs, for which large-scale parallel implementations already exist (e.g., the Hadoop subproject). Pig's language layer currently consists of a textual language called Pig Latin, which has the following key properties: . * Ease of programming It is trivial to achieve parallel execution of simple, "embarrassingly parallel" data analysis tasks. Complex tasks comprised of multiple interrelated data transformations are explicitly encoded as data flow sequences, making them easy to write, understand, and maintain. * Optimization opportunities The way in which tasks are encoded permits the system to optimize their execution automatically, allowing the user to focus on semantics rather than efficiency. * Extensibility Users can create their own functions to do special-purpose processing. Package: libhdfs0 Source: hadoop-0.20 Version: 0.20.1+169.127-1~lucid-cdh2 Architecture: i386 Maintainer: Todd Lipcon Installed-Size: 224 Depends: hadoop-0.20 (= 0.20.1+169.127-1~lucid-cdh2), libc6 (>= 2.4) Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/libhdfs0_0.20.1+169.127-1~lucid-cdh2_i386.deb Size: 149956 SHA256: 0c34721b8ae9f7d7422f9e72c9242231eee8eaa7a234faa07c3ae4b031499ba8 SHA1: d2671fe17f6eeee3dddc56ee15ac998632b6fc60 MD5sum: ecc5e4695416d891351f6717403253eb Description: JNI Bindings to access Hadoop HDFS from C See http://wiki.apache.org/hadoop/LibHDFS Package: libhdfs0-dev Source: hadoop-0.20 Version: 0.20.1+169.127-1~lucid-cdh2 Architecture: i386 Maintainer: Todd Lipcon Installed-Size: 212 Depends: hadoop-0.20 (= 0.20.1+169.127-1~lucid-cdh2), libhdfs0 (= 0.20.1+169.127-1~lucid-cdh2) Homepage: http://hadoop.apache.org/core/ Priority: extra Section: libdevel Filename: pool/contrib/h/hadoop-0.20/libhdfs0-dev_0.20.1+169.127-1~lucid-cdh2_i386.deb Size: 145704 SHA256: 2d72c314bbc71ea5c46014925e328731159ca944c3419abd2906f34202bd10ce SHA1: d8e9581c8a04d9e26cc4813e7c3680fae760b991 MD5sum: 8dfef8b61b78ba3897e1cf78241b1c5f Description: Development support for libhdfs0 Includes examples and header files for accessing HDFS from C Package: python-hive Source: hadoop-hive Version: 0.4.1+14.5-2~lucid-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 636 Depends: python (>= 2.4), python-support (>= 0.90.0) Provides: python2.6-hive Homepage: http://hadoop.apache.org/hive/ Priority: extra Section: python Filename: pool/contrib/h/hadoop-hive/python-hive_0.4.1+14.5-2~lucid-cdh2_all.deb Size: 49040 SHA256: 0b8ae5ba911f72869615b8e60364239249c97a1565ac2b3cbf357d04d09f34b8 SHA1: d40071df47c766bd9d8b92da7d056336333bb687 MD5sum: 104fed7cab0139788a71e3587ee65869 Description: Python client library to talk to the Hive Metastore This is a generated Thrift client to talk to the Hive Metastore.