Package: hadoop-0.20 Version: 0.20.2+228-1~karmic-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 44332 Depends: adduser, sun-java6-jre, sun-java6-bin Recommends: hadoop-0.20-native Provides: hadoop Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20_0.20.2+228-1~karmic-cdh3b1_all.deb Size: 22276764 SHA256: a0b85dbbc0abc9a143e7858e2e7243ded0e219d15e19b4a2587cd306a8762868 SHA1: 51206ff99ec2643c8c3c096a246ab0c4d75da09e MD5sum: 5fed695be5b4e7ebf30576320611bba2 Description: A software platform for processing vast amounts of data Hadoop is a software platform that lets one easily write and run applications that process vast amounts of data. . Here's what makes Hadoop especially useful: * Scalable: Hadoop can reliably store and process petabytes. * Economical: It distributes the data and processing across clusters of commonly available computers. These clusters can number into the thousands of nodes. * Efficient: By distributing the data, Hadoop can process it in parallel on the nodes where the data is located. This makes it extremely rapid. * Reliable: Hadoop automatically maintains multiple copies of data and automatically redeploys computing tasks based on failures. . Hadoop implements MapReduce, using the Hadoop Distributed File System (HDFS). MapReduce divides applications into many small blocks of work. HDFS creates multiple replicas of data blocks for reliability, placing them on compute nodes around the cluster. MapReduce can then process the data where it is located. Package: hadoop-0.20-conf-pseudo Source: hadoop-0.20 Version: 0.20.2+228-1~karmic-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 284 Depends: hadoop-0.20 (= 0.20.2+228-1~karmic-cdh3b1) Provides: hadoop-conf-pseudo Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-conf-pseudo_0.20.2+228-1~karmic-cdh3b1_all.deb Size: 163210 SHA256: d0ef10cf36cc9678a4f29ea47a7deff8e4172e31b414a5f74410206ea80d861b SHA1: 8270e83da3c67e3eb178a01d7f902be091ad62c0 MD5sum: 18590a53f395fa6234c3c25c48a3a60a Description: Pseudo-distributed Hadoop configuration Contains configuration files for a "pseudo-distributed" Hadoop deployment. In this mode, each of the hadoop components runs as a separate Java process, but all on the same machine. Package: hadoop-0.20-datanode Source: hadoop-0.20 Version: 0.20.2+228-1~karmic-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 204 Depends: hadoop-0.20 (= 0.20.2+228-1~karmic-cdh3b1) Provides: hadoop-datanode Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-datanode_0.20.2+228-1~karmic-cdh3b1_all.deb Size: 155892 SHA256: 2c60532b5df06193059a3103a3426b3ca474ec7191eb9f80994950733ceb7efd SHA1: 9e73fa9ac15f8733af2b87d46facca642e1fab3a MD5sum: af9635f9d601e62d502a36ac07888b58 Description: Data Node for Hadoop The Data Nodes in the Hadoop Cluster are responsible for serving up blocks of data over the network to Hadoop Distributed Filesystem (HDFS) clients. Package: hadoop-0.20-doc Source: hadoop-0.20 Version: 0.20.2+228-1~karmic-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 48456 Provides: hadoop-doc Homepage: http://hadoop.apache.org/core/ Priority: extra Section: doc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-doc_0.20.2+228-1~karmic-cdh3b1_all.deb Size: 5831858 SHA256: 0e5ace2af33afc2856dfa7e1dc5d97d6fe6eb62374a5ad4e1abe373daab3f80c SHA1: cea0c73c23cc796671fb1ee1569362ef9b489b3a MD5sum: fdb713f257324f1d87c727b00af67b57 Description: Documentation for Hadoop This package contains the Java Documentation for Hadoop and its relevant APIs. Package: hadoop-0.20-jobtracker Source: hadoop-0.20 Version: 0.20.2+228-1~karmic-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 204 Depends: hadoop-0.20 (= 0.20.2+228-1~karmic-cdh3b1) Provides: hadoop-jobtracker Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-jobtracker_0.20.2+228-1~karmic-cdh3b1_all.deb Size: 155930 SHA256: d39249c90b8b96a6dd1c87995bb11215aed357d5d09692cce5e4faf1a81d5b24 SHA1: 530d469e18790f540414ab8c48d5227a94d20c59 MD5sum: d100da3501d20ca149ba5ec39c71d884 Description: Job Tracker for Hadoop The jobtracker is a central service which is responsible for managing the tasktracker services running on all nodes in a Hadoop Cluster. The jobtracker allocates work to the tasktracker nearest to the data with an available work slot. Package: hadoop-0.20-namenode Source: hadoop-0.20 Version: 0.20.2+228-1~karmic-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 204 Depends: hadoop-0.20 (= 0.20.2+228-1~karmic-cdh3b1) Provides: hadoop-namenode Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-namenode_0.20.2+228-1~karmic-cdh3b1_all.deb Size: 155886 SHA256: 5e8b2b9b9ecf9e6c45c87144017c8d3e9117a948ea77b041269661262ee25c52 SHA1: e6ee83cf8418d45a3b1405b57becc01fce77f62d MD5sum: 5b5b8edacc29da91952d94cf9a5bae54 Description: Name Node for Hadoop The Hadoop Distributed Filesystem (HDFS) requires one unique server, the namenode, which manages the block locations of files on the filesystem. Package: hadoop-0.20-native Source: hadoop-0.20 Version: 0.20.2+228-1~karmic-cdh3b1 Architecture: i386 Maintainer: Todd Lipcon Installed-Size: 268 Depends: libc6 (>= 2.4), hadoop-0.20 (= 0.20.2+228-1~karmic-cdh3b1), liblzo2-2, libz1 Enhances: hadoop-0.20 Provides: hadoop-native Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-native_0.20.2+228-1~karmic-cdh3b1_i386.deb Size: 164136 SHA256: 613ed7b33bf2023c3d88395d1a5de4a7a9d9dbf1ec3f1eea23430cd3c59ec852 SHA1: 9b1a47d6e99b66401bfb15daf7698c9b835cf2f6 MD5sum: c02e5f1ff325c7e96bc6998cb9567a0e Description: Native libraries for Hadoop (e.g., compression) This optional package contains native libraries that increase the performance of Hadoop's compression. Package: hadoop-0.20-pipes Source: hadoop-0.20 Version: 0.20.2+228-1~karmic-cdh3b1 Architecture: i386 Maintainer: Todd Lipcon Installed-Size: 400 Depends: hadoop-0.20 (= 0.20.2+228-1~karmic-cdh3b1) Provides: hadoop-pipes Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-pipes_0.20.2+228-1~karmic-cdh3b1_i386.deb Size: 213152 SHA256: 5e1ee457c32e4051f85391f255ad6d86e53c0ad2756ef20e371ca4f75c5b36c5 SHA1: 802fa21a1c76aa53ea480e0b8842ac8e26754b99 MD5sum: 4f2cd2e53c3d04227ab6a7d8cc195890 Description: Interface to author Hadoop MapReduce jobs in C++ Contains Hadoop Pipes, a library which allows Hadoop MapReduce jobs to be written in C++. Package: hadoop-0.20-secondarynamenode Source: hadoop-0.20 Version: 0.20.2+228-1~karmic-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 204 Depends: hadoop-0.20 (= 0.20.2+228-1~karmic-cdh3b1) Provides: hadoop-secondarynamenode Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-secondarynamenode_0.20.2+228-1~karmic-cdh3b1_all.deb Size: 155932 SHA256: a629c8c7e5825786a1351d01627c4b9a5ea08534a6f9b7fadaf56e54f0045e11 SHA1: bf7aa560af70e356d2b1457c6dcb62cfd39b2700 MD5sum: 76b360a28670b638a3396aeeca808822 Description: Secondary Name Node for Hadoop The Secondary Name Node is responsible for checkpointing file system images. It is _not_ a failover pair for the namenode, and may safely be run on the same machine. Package: hadoop-0.20-source Source: hadoop-0.20 Version: 0.20.2+228-1~karmic-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 52848 Provides: hadoop-source Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-source_0.20.2+228-1~karmic-cdh3b1_all.deb Size: 16902912 SHA256: f2d5296fdc692f3c1ef4491f87bc508d379ddd968aa92b449f80775d1a613133 SHA1: 9e35430adbf5df6b4e977c0d6cbed1d036569d3b MD5sum: 8d36d95fc7ab31004e9042a982c8d927 Description: Source code for Hadoop This package contains the source code for Hadoop and its contrib modules. Package: hadoop-0.20-tasktracker Source: hadoop-0.20 Version: 0.20.2+228-1~karmic-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 204 Depends: hadoop-0.20 (= 0.20.2+228-1~karmic-cdh3b1) Provides: hadoop-tasktracker Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-tasktracker_0.20.2+228-1~karmic-cdh3b1_all.deb Size: 155904 SHA256: ac688124744b57ef5226a02037bb7d7442181ed61936193a0a1367fb15312f4f SHA1: 2b71daa521b8e19e4d3989043122594e85d9fa40 MD5sum: c85665bc07edf102696f009752410fa2 Description: Task Tracker for Hadoop The Task Tracker is the Hadoop service that accepts MapReduce tasks and computes results. Each node in a Hadoop cluster that should be doing computation should run a Task Tracker. Package: hadoop-hive Version: 0.5.0+20-1~karmic-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 13132 Depends: sun-java6-jre | sun-java6-sdk, hadoop Homepage: http://hadoop.apache.org/hive/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-hive/hadoop-hive_0.5.0+20-1~karmic-cdh3b1_all.deb Size: 11109004 SHA256: 4a12a2429194d45a6eb721cb77ab2f55571142ba67ddf50de004731f9f7d0c7f SHA1: 46da18896af24c3283d7a78a1268d292aae6866a MD5sum: b0c76c64e4cb2b489ebe92d20e36f4ea Description: A data warehouse infrastructure built on top of Hadoop Hive is a data warehouse infrastructure built on top of Hadoop that provides tools to enable easy data summarization, adhoc querying and analysis of large datasets data stored in Hadoop files. It provides a mechanism to put structure on this data and it also provides a simple query language called Hive QL which is based on SQL and which enables users familiar with SQL to query this data. At the same time, this language also allows traditional map/reduce programmers to be able to plug in their custom mappers and reducers to do more sophisticated analysis which may not be supported by the built-in capabilities of the language. Package: hadoop-pig Version: 0.5.0+30-1~karmic-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 79756 Depends: sun-java6-jre | sun-java6-sdk, hadoop Homepage: http://hadoop.apache.org/pig/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-pig/hadoop-pig_0.5.0+30-1~karmic-cdh3b1_all.deb Size: 30156006 SHA256: af5d4a10a0ca93d232ddd99a76da566d713d693abd7fe50f747b1d98b7cf7633 SHA1: 803e96102754d30b890147f34902caf28956449b MD5sum: a28ba2f603b514451270a84661e19252 Description: A platform for analyzing large data sets using Hadoop Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, coupled with infrastructure for evaluating these programs. The salient property of Pig programs is that their structure is amenable to substantial parallelization, which in turns enables them to handle very large data sets. . At the present time, Pig's infrastructure layer consists of a compiler that produces sequences of Map-Reduce programs, for which large-scale parallel implementations already exist (e.g., the Hadoop subproject). Pig's language layer currently consists of a textual language called Pig Latin, which has the following key properties: . * Ease of programming It is trivial to achieve parallel execution of simple, "embarrassingly parallel" data analysis tasks. Complex tasks comprised of multiple interrelated data transformations are explicitly encoded as data flow sequences, making them easy to write, understand, and maintain. * Optimization opportunities The way in which tasks are encoded permits the system to optimize their execution automatically, allowing the user to focus on semantics rather than efficiency. * Extensibility Users can create their own functions to do special-purpose processing. Package: libhdfs0 Source: hadoop-0.20 Version: 0.20.2+228-1~karmic-cdh3b1 Architecture: i386 Maintainer: Todd Lipcon Installed-Size: 240 Depends: hadoop-0.20 (= 0.20.2+228-1~karmic-cdh3b1), libc6 (>= 2.4) Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/libhdfs0_0.20.2+228-1~karmic-cdh3b1_i386.deb Size: 169744 SHA256: bac55fbf7e7253bd9c3a5a5768d0d81f9cbca9a01efe2358fb82d0e839c1a6b0 SHA1: e01abbb9343880679023b79421a449ff9f7ecf2c MD5sum: d71f05931922709450a6c940f0f10bab Description: JNI Bindings to access Hadoop HDFS from C See http://wiki.apache.org/hadoop/LibHDFS Package: libhdfs0-dev Source: hadoop-0.20 Version: 0.20.2+228-1~karmic-cdh3b1 Architecture: i386 Maintainer: Todd Lipcon Installed-Size: 228 Depends: hadoop-0.20 (= 0.20.2+228-1~karmic-cdh3b1), libhdfs0 (= 0.20.2+228-1~karmic-cdh3b1) Homepage: http://hadoop.apache.org/core/ Priority: extra Section: libdevel Filename: pool/contrib/h/hadoop-0.20/libhdfs0-dev_0.20.2+228-1~karmic-cdh3b1_i386.deb Size: 164818 SHA256: 9c0dcec97fe6fb2045c2111005261e84feee98aff9d30fcfb7b00754f9c87367 SHA1: 2ec7d7586798c60954823a4bfcd6aa7ac230c4a2 MD5sum: 176fc65054473790b6799d02b45d688a Description: Development support for libhdfs0 Includes examples and header files for accessing HDFS from C Package: python-hive Source: hadoop-hive Version: 0.5.0+20-1~karmic-cdh3b1 Architecture: all Maintainer: Todd Lipcon Installed-Size: 700 Depends: python (>= 2.4), python-support (>= 0.90.0) Provides: python2.5-hive, python2.6-hive Homepage: http://hadoop.apache.org/hive/ Priority: extra Section: python Filename: pool/contrib/h/hadoop-hive/python-hive_0.5.0+20-1~karmic-cdh3b1_all.deb Size: 54842 SHA256: 08647ea0567cf2560a813b70ccf9088994af895544f3569820932d466df5ed91 SHA1: e9547a715e62554039a84dae37803de502fb747d MD5sum: e99d01cbf3195ad2d3140f793988782b Description: Python client library to talk to the Hive Metastore This is a generated Thrift client to talk to the Hive Metastore.