Package: hadoop-0.20 Version: 0.20.2+228-1~lenny-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 44332 Depends: adduser, sun-java6-jre, sun-java6-bin Recommends: hadoop-0.20-native Provides: hadoop Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20_0.20.2+228-1~lenny-cdh3b1_all.deb Size: 22276284 SHA256: cff9a815416aea332f1a7b51bfccb1a231276e0c99cb32af32be30488eb6d68c SHA1: bed2963b49a550eab46a554eb47b48d2ee24dde5 MD5sum: b41dd385f87c7ad81d0ff081838831a0 Description: A software platform for processing vast amounts of data Hadoop is a software platform that lets one easily write and run applications that process vast amounts of data. . Here's what makes Hadoop especially useful: * Scalable: Hadoop can reliably store and process petabytes. * Economical: It distributes the data and processing across clusters of commonly available computers. These clusters can number into the thousands of nodes. * Efficient: By distributing the data, Hadoop can process it in parallel on the nodes where the data is located. This makes it extremely rapid. * Reliable: Hadoop automatically maintains multiple copies of data and automatically redeploys computing tasks based on failures. . Hadoop implements MapReduce, using the Hadoop Distributed File System (HDFS). MapReduce divides applications into many small blocks of work. HDFS creates multiple replicas of data blocks for reliability, placing them on compute nodes around the cluster. MapReduce can then process the data where it is located. Package: hadoop-0.20-conf-pseudo Source: hadoop-0.20 Version: 0.20.2+228-1~lenny-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 284 Depends: hadoop-0.20 (= 0.20.2+228-1~lenny-cdh3b1) Provides: hadoop-conf-pseudo Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-conf-pseudo_0.20.2+228-1~lenny-cdh3b1_all.deb Size: 163358 SHA256: 97c128b15c90ee4e5a6ec778bb982c39d585ef5fa4a9d45d64d4a364abbe15b4 SHA1: 9830c0377e1ef6947aadc2cae23837f7d45256fa MD5sum: 655a6e4f58ce036362da707e8b2a4e23 Description: Pseudo-distributed Hadoop configuration Contains configuration files for a "pseudo-distributed" Hadoop deployment. In this mode, each of the hadoop components runs as a separate Java process, but all on the same machine. Package: hadoop-0.20-datanode Source: hadoop-0.20 Version: 0.20.2+228-1~lenny-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 204 Depends: hadoop-0.20 (= 0.20.2+228-1~lenny-cdh3b1) Provides: hadoop-datanode Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-datanode_0.20.2+228-1~lenny-cdh3b1_all.deb Size: 155782 SHA256: 9cf2296f2f15dac7efdc340d372808fe5f45f96b246783976c237366ee61c32a SHA1: c4f950f5eeaa28f525a5112be51b187c33f54bf3 MD5sum: fd91a5bcac99eb17a4a8e98aa3baa789 Description: Data Node for Hadoop The Data Nodes in the Hadoop Cluster are responsible for serving up blocks of data over the network to Hadoop Distributed Filesystem (HDFS) clients. Package: hadoop-0.20-doc Source: hadoop-0.20 Version: 0.20.2+228-1~lenny-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 48456 Provides: hadoop-doc Homepage: http://hadoop.apache.org/core/ Priority: extra Section: doc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-doc_0.20.2+228-1~lenny-cdh3b1_all.deb Size: 5835452 SHA256: d80d67bc67b8ad181bd43c20b874128b758f608a97686bafe94f80f88bee74ae SHA1: e0dbb3f40ed4313c479f0fc7efa18b4e8bc39189 MD5sum: 9d5e4016b2b1adc6ed8fe40db53953b6 Description: Documentation for Hadoop This package contains the Java Documentation for Hadoop and its relevant APIs. Package: hadoop-0.20-jobtracker Source: hadoop-0.20 Version: 0.20.2+228-1~lenny-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 204 Depends: hadoop-0.20 (= 0.20.2+228-1~lenny-cdh3b1) Provides: hadoop-jobtracker Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-jobtracker_0.20.2+228-1~lenny-cdh3b1_all.deb Size: 155820 SHA256: a4905a0ee51bc74ab96acde081d2b038d8130e9332a42136abd622b54cd679f5 SHA1: 39aebfc5369fbce6c501f02e0a6b780ef1fe52d1 MD5sum: cdf855e621c5dc2d2d630ad1ebd20df4 Description: Job Tracker for Hadoop The jobtracker is a central service which is responsible for managing the tasktracker services running on all nodes in a Hadoop Cluster. The jobtracker allocates work to the tasktracker nearest to the data with an available work slot. Package: hadoop-0.20-namenode Source: hadoop-0.20 Version: 0.20.2+228-1~lenny-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 204 Depends: hadoop-0.20 (= 0.20.2+228-1~lenny-cdh3b1) Provides: hadoop-namenode Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-namenode_0.20.2+228-1~lenny-cdh3b1_all.deb Size: 155782 SHA256: 8d916c175761065ebf3f49231d516dd6c634e77e7836292d0dcb7130f4e72d0d SHA1: d8b575eacc7d09454e04495b4569577f27c24a06 MD5sum: 80db255aab2be66d97959b57c7d756b6 Description: Name Node for Hadoop The Hadoop Distributed Filesystem (HDFS) requires one unique server, the namenode, which manages the block locations of files on the filesystem. Package: hadoop-0.20-native Source: hadoop-0.20 Version: 0.20.2+228-1~lenny-cdh3b1 Architecture: i386 Maintainer: Todd Lipcon Installed-Size: 264 Depends: libc6 (>= 2.7-1), hadoop-0.20 (= 0.20.2+228-1~lenny-cdh3b1), liblzo2-2, libz1 Enhances: hadoop-0.20 Provides: hadoop-native Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-native_0.20.2+228-1~lenny-cdh3b1_i386.deb Size: 164228 SHA256: 293e45a15f946d4db5225026b3d5691151390938d678346e4a5a17da5fd8470f SHA1: ae6cd3d2b5c24fa07d13f7220cd1f49a9631c052 MD5sum: bede1e99a65974492f3bebe99a2881c9 Description: Native libraries for Hadoop (e.g., compression) This optional package contains native libraries that increase the performance of Hadoop's compression. Package: hadoop-0.20-pipes Source: hadoop-0.20 Version: 0.20.2+228-1~lenny-cdh3b1 Architecture: i386 Maintainer: Todd Lipcon Installed-Size: 400 Depends: hadoop-0.20 (= 0.20.2+228-1~lenny-cdh3b1) Provides: hadoop-pipes Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-pipes_0.20.2+228-1~lenny-cdh3b1_i386.deb Size: 212878 SHA256: bbbdb8a8cb4b766bfed05be8bab4d517586c315118f9873832c09165211da6e9 SHA1: 7f4568e326251ce3feb68a5c31aa9d55e55ba112 MD5sum: 99571ce4ca3b6c84bbc6d8597d6a9fd2 Description: Interface to author Hadoop MapReduce jobs in C++ Contains Hadoop Pipes, a library which allows Hadoop MapReduce jobs to be written in C++. Package: hadoop-0.20-secondarynamenode Source: hadoop-0.20 Version: 0.20.2+228-1~lenny-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 204 Depends: hadoop-0.20 (= 0.20.2+228-1~lenny-cdh3b1) Provides: hadoop-secondarynamenode Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-secondarynamenode_0.20.2+228-1~lenny-cdh3b1_all.deb Size: 155814 SHA256: bce400b5f70b0fb3425e29adecab1cb1b24dc02cf69995b01e8c1a605f7ffc37 SHA1: fbb69bcb0675508bc7be354a1c0f06f51ed59519 MD5sum: 527cf35f7112b501f011303267975077 Description: Secondary Name Node for Hadoop The Secondary Name Node is responsible for checkpointing file system images. It is _not_ a failover pair for the namenode, and may safely be run on the same machine. Package: hadoop-0.20-source Source: hadoop-0.20 Version: 0.20.2+228-1~lenny-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 52968 Provides: hadoop-source Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-source_0.20.2+228-1~lenny-cdh3b1_all.deb Size: 16890578 SHA256: 4f7627ea80fecdc13add43584a1274806d6c16c160bee64eec281c033f8d1ed2 SHA1: 72b20e6f0b42fe8ba89cdb8aae66432eb20e8a82 MD5sum: 0bdd048360321ff95b53a1965e0dd363 Description: Source code for Hadoop This package contains the source code for Hadoop and its contrib modules. Package: hadoop-0.20-tasktracker Source: hadoop-0.20 Version: 0.20.2+228-1~lenny-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 204 Depends: hadoop-0.20 (= 0.20.2+228-1~lenny-cdh3b1) Provides: hadoop-tasktracker Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-tasktracker_0.20.2+228-1~lenny-cdh3b1_all.deb Size: 155794 SHA256: e43ce52e54591e3b87603173a65ac5d33a552238198dbbf1e6d3542beafce425 SHA1: 2c283f83104fce1d3bbb4b13bd7a8a835e505247 MD5sum: 30bf9dc2c66cd4b7892a04df02bb0442 Description: Task Tracker for Hadoop The Task Tracker is the Hadoop service that accepts MapReduce tasks and computes results. Each node in a Hadoop cluster that should be doing computation should run a Task Tracker. Package: hadoop-hive Version: 0.5.0+20-1~lenny-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 13132 Depends: sun-java6-jre | sun-java6-sdk, hadoop Homepage: http://hadoop.apache.org/hive/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-hive/hadoop-hive_0.5.0+20-1~lenny-cdh3b1_all.deb Size: 11111488 SHA256: aff4e99305dffb69e60da2403a41abdee36289bcedf5b4ee198dcc9a98089aa5 SHA1: 70629f24503f576dcc5f02c4c5b226efca27fe9e MD5sum: 7f0cbe58fe5e56f87d7d7ef4cdd5f144 Description: A data warehouse infrastructure built on top of Hadoop Hive is a data warehouse infrastructure built on top of Hadoop that provides tools to enable easy data summarization, adhoc querying and analysis of large datasets data stored in Hadoop files. It provides a mechanism to put structure on this data and it also provides a simple query language called Hive QL which is based on SQL and which enables users familiar with SQL to query this data. At the same time, this language also allows traditional map/reduce programmers to be able to plug in their custom mappers and reducers to do more sophisticated analysis which may not be supported by the built-in capabilities of the language. Package: hadoop-pig Version: 0.5.0+30-1~lenny-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 79756 Depends: sun-java6-jre | sun-java6-sdk, hadoop Homepage: http://hadoop.apache.org/pig/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-pig/hadoop-pig_0.5.0+30-1~lenny-cdh3b1_all.deb Size: 30154898 SHA256: 57a228a5c6fc40766bdf7d1e9dce5dbe64ee976350b7f57999440ce722667ad2 SHA1: f523014e47d9c2169c4c2a9a7424916a1f19cb77 MD5sum: c586a4a224a2bc9fedbbd3aae53ee737 Description: A platform for analyzing large data sets using Hadoop Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, coupled with infrastructure for evaluating these programs. The salient property of Pig programs is that their structure is amenable to substantial parallelization, which in turns enables them to handle very large data sets. . At the present time, Pig's infrastructure layer consists of a compiler that produces sequences of Map-Reduce programs, for which large-scale parallel implementations already exist (e.g., the Hadoop subproject). Pig's language layer currently consists of a textual language called Pig Latin, which has the following key properties: . * Ease of programming It is trivial to achieve parallel execution of simple, "embarrassingly parallel" data analysis tasks. Complex tasks comprised of multiple interrelated data transformations are explicitly encoded as data flow sequences, making them easy to write, understand, and maintain. * Optimization opportunities The way in which tasks are encoded permits the system to optimize their execution automatically, allowing the user to focus on semantics rather than efficiency. * Extensibility Users can create their own functions to do special-purpose processing. Package: libhdfs0 Source: hadoop-0.20 Version: 0.20.2+228-1~lenny-cdh3b1 Architecture: i386 Maintainer: Todd Lipcon Installed-Size: 236 Depends: hadoop-0.20 (= 0.20.2+228-1~lenny-cdh3b1), libc6 (>= 2.7-1) Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/libhdfs0_0.20.2+228-1~lenny-cdh3b1_i386.deb Size: 169982 SHA256: 148d5e3c0bea7caf04128316ae62042286494b6906b8e7253f8dac63fc03a012 SHA1: 528cf7c69e6fca74fe6f3a28df232866080c17c8 MD5sum: 2eb011ce73fbdaa6bb3b31e439db9ed9 Description: JNI Bindings to access Hadoop HDFS from C See http://wiki.apache.org/hadoop/LibHDFS Package: libhdfs0-dev Source: hadoop-0.20 Version: 0.20.2+228-1~lenny-cdh3b1 Architecture: i386 Maintainer: Todd Lipcon Installed-Size: 228 Depends: hadoop-0.20 (= 0.20.2+228-1~lenny-cdh3b1), libhdfs0 (= 0.20.2+228-1~lenny-cdh3b1) Homepage: http://hadoop.apache.org/core/ Priority: extra Section: libdevel Filename: pool/contrib/h/hadoop-0.20/libhdfs0-dev_0.20.2+228-1~lenny-cdh3b1_i386.deb Size: 165706 SHA256: 3912287f2b28d79d09dd7dd9ebc344431f1a614b02d9ae1467e26846558d51d7 SHA1: 991bb951265b9a474f5ababe0285627102e5b046 MD5sum: 8dd85e919e8d5ab7d879b7545372720c Description: Development support for libhdfs0 Includes examples and header files for accessing HDFS from C Package: python-hive Source: hadoop-hive Version: 0.5.0+20-1~lenny-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 700 Depends: python (>= 2.4), python-support (>= 0.7.1) Provides: python2.4-hive, python2.5-hive Homepage: http://hadoop.apache.org/hive/ Priority: extra Section: python Filename: pool/contrib/h/hadoop-hive/python-hive_0.5.0+20-1~lenny-cdh3b1_all.deb Size: 55294 SHA256: c3802bb97e01b9db780d32bb5cca0f65fcd8e786eaced3f9c74e76b98ea5a5b2 SHA1: 30839b70f513bc9e5b23993fc409055bfc8e8613 MD5sum: 6ad5333d0c37ef0d46b1b13068d0a635 Description: Python client library to talk to the Hive Metastore This is a generated Thrift client to talk to the Hive Metastore.