Package: hadoop-0.18 Version: 0.18.3+76.2-1~karmic-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 21884 Depends: adduser, sun-java6-jre Recommends: hadoop-0.18-native Provides: hadoop Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.18/hadoop-0.18_0.18.3+76.2-1~karmic-cdh2_all.deb Size: 12495782 SHA256: 2784782898650b05a1d00ffa90cd0b90a739179e6d38e05557d040dec9426c50 SHA1: 26203641ac85d836ae1c56a4b4f4eb54291632ed MD5sum: 3d595cc01df320df2a7e4ddacfecdc17 Description: A software platform for processing vast amounts of data Hadoop is a software platform that lets one easily write and run applications that process vast amounts of data. . Here's what makes Hadoop especially useful: * Scalable: Hadoop can reliably store and process petabytes. * Economical: It distributes the data and processing across clusters of commonly available computers. These clusters can number into the thousands of nodes. * Efficient: By distributing the data, Hadoop can process it in parallel on the nodes where the data is located. This makes it extremely rapid. * Reliable: Hadoop automatically maintains multiple copies of data and automatically redeploys computing tasks based on failures. . Hadoop implements MapReduce, using the Hadoop Distributed File System (HDFS). MapReduce divides applications into many small blocks of work. HDFS creates multiple replicas of data blocks for reliability, placing them on compute nodes around the cluster. MapReduce can then process the data where it is located. Package: hadoop-0.18-conf-pseudo Source: hadoop-0.18 Version: 0.18.3+76.2-1~karmic-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 232 Depends: hadoop-0.18 (= 0.18.3+76.2-1~karmic-cdh2) Provides: hadoop-conf-pseudo Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.18/hadoop-0.18-conf-pseudo_0.18.3+76.2-1~karmic-cdh2_all.deb Size: 97770 SHA256: 6a999037abce55ee4a7368f5bb7de50e158c71a7dae99dbb7153514fc3533971 SHA1: 90a3f8e481130d1d8ce533646efdcc85ebbf3b0d MD5sum: 0e94e6a7b75dfaab358a57e72ed9240d Description: Pseudo-distributed Hadoop configuration Contains configuration files for a "pseudo-distributed" Hadoop deployment. In this mode, each of the hadoop components runs as a separate Java process, but all on the same machine. Package: hadoop-0.18-datanode Source: hadoop-0.18 Version: 0.18.3+76.2-1~karmic-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 136 Depends: hadoop-0.18 (= 0.18.3+76.2-1~karmic-cdh2) Provides: hadoop-datanode Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.18/hadoop-0.18-datanode_0.18.3+76.2-1~karmic-cdh2_all.deb Size: 84604 SHA256: ac862adc47c98ec919bfe4a6ec3891b1148791fa1066fa2e6b9c16b5185d8675 SHA1: 6526a4ae42818c64a7ef0a210b065d05fdf92360 MD5sum: 075725d64266690b36d5656dfe41f61b Description: Data Node for Hadoop The Data Nodes in the Hadoop Cluster are responsible for serving up blocks of data over the network to Hadoop Distributed Filesystem (HDFS) clients. Package: hadoop-0.18-doc Source: hadoop-0.18 Version: 0.18.3+76.2-1~karmic-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 37548 Provides: hadoop-doc Homepage: http://hadoop.apache.org/core/ Priority: extra Section: doc Filename: pool/contrib/h/hadoop-0.18/hadoop-0.18-doc_0.18.3+76.2-1~karmic-cdh2_all.deb Size: 5925030 SHA256: c40adf304ee05a89f40b8c414b85d3318a37565ae3ab4ea122284f19b4211616 SHA1: bd08bb04bad066192f045b52f7fdef0c8c26c2b1 MD5sum: 192716756c83d8c784fe7892ac029dcf Description: Documentation for Hadoop This package contains the Java Documentation for Hadoop and its relevant APIs. Package: hadoop-0.18-jobtracker Source: hadoop-0.18 Version: 0.18.3+76.2-1~karmic-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 136 Depends: hadoop-0.18 (= 0.18.3+76.2-1~karmic-cdh2) Provides: hadoop-jobtracker Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.18/hadoop-0.18-jobtracker_0.18.3+76.2-1~karmic-cdh2_all.deb Size: 84642 SHA256: a4fb928042f031d826ad0a34117272a21819a73189c53f3b264229f7752385c2 SHA1: 92107807cf203f9a8059d50c73222b4ec06af7c5 MD5sum: 2b07aa1543133af9adcbedf0d7d40e89 Description: Job Tracker for Hadoop The jobtracker is a central service which is responsible for managing the tasktracker services running on all nodes in a Hadoop Cluster. The jobtracker allocates work to the tasktracker nearest to the data with an available work slot. Package: hadoop-0.18-namenode Source: hadoop-0.18 Version: 0.18.3+76.2-1~karmic-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 136 Depends: hadoop-0.18 (= 0.18.3+76.2-1~karmic-cdh2) Provides: hadoop-namenode Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.18/hadoop-0.18-namenode_0.18.3+76.2-1~karmic-cdh2_all.deb Size: 84608 SHA256: 532e64d1cea65a51cbec813c02db81ce128975a9fe0fcc3481eafbb5eaf95f05 SHA1: a275d61f818269f6b16cc0e1f06a5731279096de MD5sum: d1e909c330ceea94349f6300bab30880 Description: Name Node for Hadoop The Hadoop Distributed Filesystem (HDFS) requires one unique server, the namenode, which manages the block locations of files on the filesystem. Package: hadoop-0.18-native Source: hadoop-0.18 Version: 0.18.3+76.2-1~karmic-cdh2 Architecture: amd64 Maintainer: Todd Lipcon Installed-Size: 236 Depends: libc6 (>= 2.4), hadoop-0.18 (= 0.18.3+76.2-1~karmic-cdh2), liblzo2-2, libz1 Enhances: hadoop-0.18 Provides: hadoop-native Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.18/hadoop-0.18-native_0.18.3+76.2-1~karmic-cdh2_amd64.deb Size: 100638 SHA256: 30265e3daa5e78ca3a09e60cc7eda44042650afc48aff0fc660d096ea00f07b3 SHA1: 5d24b85da4d3ad4cc078b2d63d3565d8ebf56522 MD5sum: 5da1e34b3c048b4fd0f8e6480316a9b0 Description: Native libraries for Hadoop (e.g., compression) This optional package contains native libraries that increase the performance of Hadoop's compression. Package: hadoop-0.18-pipes Source: hadoop-0.18 Version: 0.18.3+76.2-1~karmic-cdh2 Architecture: amd64 Maintainer: Todd Lipcon Installed-Size: 396 Depends: hadoop-0.18 (= 0.18.3+76.2-1~karmic-cdh2) Provides: hadoop-pipes Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.18/hadoop-0.18-pipes_0.18.3+76.2-1~karmic-cdh2_amd64.deb Size: 143212 SHA256: afc8d0569c8c10c4fc4daa1559e68ff391d2259bb70f9663f6b6e3a0a4b303a0 SHA1: 4b4425b67a12a9bec8c1bb555041d9abcb98bfb5 MD5sum: 130910a607f72fe4dab81814a805b2c4 Description: Interface to author Hadoop MapReduce jobs in C++ Contains Hadoop Pipes, a library which allows Hadoop MapReduce jobs to be written in C++. Package: hadoop-0.18-secondarynamenode Source: hadoop-0.18 Version: 0.18.3+76.2-1~karmic-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 136 Depends: hadoop-0.18 (= 0.18.3+76.2-1~karmic-cdh2) Provides: hadoop-secondarynamenode Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.18/hadoop-0.18-secondarynamenode_0.18.3+76.2-1~karmic-cdh2_all.deb Size: 84638 SHA256: 63426f65aca0ed5cb4ef8d27cec0953419a4ead05773ff6ce5dca025530acae7 SHA1: 3e8da40341fc548c454d921a300ea0161120603e MD5sum: fede2e85070c6535c7c6e6db3acff388 Description: Secondary Name Node for Hadoop The Secondary Name Node is responsible for checkpointing file system images. It is _not_ a failover pair for the namenode, and may safely be run on the same machine. Package: hadoop-0.18-source Source: hadoop-0.18 Version: 0.18.3+76.2-1~karmic-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 45588 Provides: hadoop-source Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.18/hadoop-0.18-source_0.18.3+76.2-1~karmic-cdh2_all.deb Size: 16305862 SHA256: 4aea47b2f1e95fb5870b5ddc01f80526e20e50acac2ca7a726749b2c09f691fa SHA1: bd38c42b03e01efedc889903f133c87c58f14356 MD5sum: 5c9476d9451f2cbd9abe525bc363f3a6 Description: Source code for Hadoop This package contains the source code for Hadoop and its contrib modules. Package: hadoop-0.18-tasktracker Source: hadoop-0.18 Version: 0.18.3+76.2-1~karmic-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 136 Depends: hadoop-0.18 (= 0.18.3+76.2-1~karmic-cdh2) Provides: hadoop-tasktracker Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.18/hadoop-0.18-tasktracker_0.18.3+76.2-1~karmic-cdh2_all.deb Size: 84618 SHA256: bf279a1b8f01da563589645ed24e08aa8e7c592f020ab54b62d8d1d90bcab910 SHA1: 64100b0b3abddc4f3e9f571e6cfe524b90b25ff0 MD5sum: d222cc125c665dfc701286655c6e781b Description: Task Tracker for Hadoop The Task Tracker is the Hadoop service that accepts MapReduce tasks and computes results. Each node in a Hadoop cluster that should be doing computation should run a Task Tracker. Package: hadoop-0.20 Version: 0.20.1+169.127-1~karmic-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 42520 Depends: adduser, sun-java6-jre, sun-java6-bin Recommends: hadoop-0.20-native Provides: hadoop Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20_0.20.1+169.127-1~karmic-cdh2_all.deb Size: 22094302 SHA256: 7ef8843361724cce78af4f7a727c40cd3fa5ab305247c0fb7c7962a3cd51b285 SHA1: 65000b70cfffe376135807873fff0e7584cd9205 MD5sum: a613f540aecb72c03d0bcc7ccb3bb2b1 Description: A software platform for processing vast amounts of data Hadoop is a software platform that lets one easily write and run applications that process vast amounts of data. . Here's what makes Hadoop especially useful: * Scalable: Hadoop can reliably store and process petabytes. * Economical: It distributes the data and processing across clusters of commonly available computers. These clusters can number into the thousands of nodes. * Efficient: By distributing the data, Hadoop can process it in parallel on the nodes where the data is located. This makes it extremely rapid. * Reliable: Hadoop automatically maintains multiple copies of data and automatically redeploys computing tasks based on failures. . Hadoop implements MapReduce, using the Hadoop Distributed File System (HDFS). MapReduce divides applications into many small blocks of work. HDFS creates multiple replicas of data blocks for reliability, placing them on compute nodes around the cluster. MapReduce can then process the data where it is located. Package: hadoop-0.20-conf-pseudo Source: hadoop-0.20 Version: 0.20.1+169.127-1~karmic-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 268 Depends: hadoop-0.20 (= 0.20.1+169.127-1~karmic-cdh2) Provides: hadoop-conf-pseudo Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-conf-pseudo_0.20.1+169.127-1~karmic-cdh2_all.deb Size: 143264 SHA256: c6aeeb11119ac0377c7a0b33166d30ed1200848d6afbf10f17bd46a399d61df5 SHA1: 6a1e384d2e437dac8e2e278377598f542982475a MD5sum: e430cd245b92fbf28c0b16d8ef96ebb5 Description: Pseudo-distributed Hadoop configuration Contains configuration files for a "pseudo-distributed" Hadoop deployment. In this mode, each of the hadoop components runs as a separate Java process, but all on the same machine. Package: hadoop-0.20-datanode Source: hadoop-0.20 Version: 0.20.1+169.127-1~karmic-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 188 Depends: hadoop-0.20 (= 0.20.1+169.127-1~karmic-cdh2) Provides: hadoop-datanode Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-datanode_0.20.1+169.127-1~karmic-cdh2_all.deb Size: 136314 SHA256: 986a8e74a2a9c3924647dfd3287c2aa7572d37933b8b9457263b37aa13a970fc SHA1: 3c7c49bfd1b4b5bba6bb05b8f9e85be27d7e01b7 MD5sum: 0628c4b5a7c732acd55ee8e71c655cea Description: Data Node for Hadoop The Data Nodes in the Hadoop Cluster are responsible for serving up blocks of data over the network to Hadoop Distributed Filesystem (HDFS) clients. Package: hadoop-0.20-doc Source: hadoop-0.20 Version: 0.20.1+169.127-1~karmic-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 47288 Provides: hadoop-doc Homepage: http://hadoop.apache.org/core/ Priority: extra Section: doc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-doc_0.20.1+169.127-1~karmic-cdh2_all.deb Size: 5719364 SHA256: 688f9c2dcc7435fed51500b95ad25bb71e02f5eaf7f17775baf6440d5b5326b2 SHA1: e936d96c1600c3be43d0159759a15d98282feed5 MD5sum: 5b906bda7f3f7ae7814e24b702f4a740 Description: Documentation for Hadoop This package contains the Java Documentation for Hadoop and its relevant APIs. Package: hadoop-0.20-fuse Source: hadoop-0.20 Version: 0.20.1+169.127-1~karmic-cdh2 Architecture: amd64 Maintainer: Todd Lipcon Installed-Size: 296 Depends: libc6 (>= 2.7), libfuse2 (>= 2.6), libhdfs0, hadoop-0.20 (= 0.20.1+169.127-1~karmic-cdh2), fuse-utils Enhances: hadoop-0.20 Provides: hadoop-fuse Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-fuse_0.20.1+169.127-1~karmic-cdh2_amd64.deb Size: 168968 SHA256: 34203b0cf8b414a2622c51c59315b94ce6fa6d09c7ed05422c79a27ac53e3f55 SHA1: ed861bf5a60ce383f977518060dffce4185ba3c9 MD5sum: a56312e1d2a9e412d715d01b0289bcb0 Description: HDFS exposed over a Filesystem in Userspace These projects (enumerated below) allow HDFS to be mounted (on most flavors of Unix) as a standard file system using the mount command. Once mounted, the user can operate on an instance of hdfs using standard Unix utilities such as 'ls', 'cd', 'cp', 'mkdir', 'find', 'grep', or use standard Posix libraries like open, write, read, close from C, C++, Python, Ruby, Perl, Java, bash, etc. Package: hadoop-0.20-jobtracker Source: hadoop-0.20 Version: 0.20.1+169.127-1~karmic-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 188 Depends: hadoop-0.20 (= 0.20.1+169.127-1~karmic-cdh2) Provides: hadoop-jobtracker Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-jobtracker_0.20.1+169.127-1~karmic-cdh2_all.deb Size: 136348 SHA256: 412b82b2b47e485125ff82f8405ca214f1616aaa07f8ec2d3d9f86899770f3a1 SHA1: 8bed9a3798bd01067af52fedec519d6d1dede6df MD5sum: 4152fc257e504c5fea0bd5c8891a9247 Description: Job Tracker for Hadoop The jobtracker is a central service which is responsible for managing the tasktracker services running on all nodes in a Hadoop Cluster. The jobtracker allocates work to the tasktracker nearest to the data with an available work slot. Package: hadoop-0.20-namenode Source: hadoop-0.20 Version: 0.20.1+169.127-1~karmic-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 188 Depends: hadoop-0.20 (= 0.20.1+169.127-1~karmic-cdh2) Provides: hadoop-namenode Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-namenode_0.20.1+169.127-1~karmic-cdh2_all.deb Size: 136312 SHA256: 7cad375475a33204b0d36565202b96c42c20233a6e676b346ddb7a2f5e3cd70a SHA1: 9929e3483f755bf486bd4798d526604d8bdf33bb MD5sum: 23dd94d460c3c21644fb64360e3701da Description: Name Node for Hadoop The Hadoop Distributed Filesystem (HDFS) requires one unique server, the namenode, which manages the block locations of files on the filesystem. Package: hadoop-0.20-native Source: hadoop-0.20 Version: 0.20.1+169.127-1~karmic-cdh2 Architecture: amd64 Maintainer: Todd Lipcon Installed-Size: 256 Depends: libc6 (>= 2.4), hadoop-0.20 (= 0.20.1+169.127-1~karmic-cdh2), liblzo2-2, libz1 Enhances: hadoop-0.20 Provides: hadoop-native Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-native_0.20.1+169.127-1~karmic-cdh2_amd64.deb Size: 145272 SHA256: a4c291287f23a14f99126de709d78823ba5eee4df1507aee9b52433ea1c21d6a SHA1: eac74724dc1d922c7894a09da9b566f16c783867 MD5sum: 8c7c6e21f92abffe653c157c59ab745e Description: Native libraries for Hadoop (e.g., compression) This optional package contains native libraries that increase the performance of Hadoop's compression. Package: hadoop-0.20-pipes Source: hadoop-0.20 Version: 0.20.1+169.127-1~karmic-cdh2 Architecture: amd64 Maintainer: Todd Lipcon Installed-Size: 456 Depends: hadoop-0.20 (= 0.20.1+169.127-1~karmic-cdh2) Provides: hadoop-pipes Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-pipes_0.20.1+169.127-1~karmic-cdh2_amd64.deb Size: 196382 SHA256: 90d371a59bf7a6bfdf6848934a407b055f14c9cbd59830921ec40f4782fd9c6b SHA1: 80707497003815d25280403c9146a30acf02e44a MD5sum: 7f3f6e1004de1ef21521b0f9f9b576b8 Description: Interface to author Hadoop MapReduce jobs in C++ Contains Hadoop Pipes, a library which allows Hadoop MapReduce jobs to be written in C++. Package: hadoop-0.20-secondarynamenode Source: hadoop-0.20 Version: 0.20.1+169.127-1~karmic-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 188 Depends: hadoop-0.20 (= 0.20.1+169.127-1~karmic-cdh2) Provides: hadoop-secondarynamenode Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-secondarynamenode_0.20.1+169.127-1~karmic-cdh2_all.deb Size: 136346 SHA256: 60f9aae5b31c93a2647ced42c5ae41bf403567af0103ee37e430504f2c377938 SHA1: e8990707127b6c428f88a2639d03adf17b8c35a1 MD5sum: eb36b6e78efeb39e88f5632b491bf4f0 Description: Secondary Name Node for Hadoop The Secondary Name Node is responsible for checkpointing file system images. It is _not_ a failover pair for the namenode, and may safely be run on the same machine. Package: hadoop-0.20-source Source: hadoop-0.20 Version: 0.20.1+169.127-1~karmic-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 52256 Provides: hadoop-source Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-source_0.20.1+169.127-1~karmic-cdh2_all.deb Size: 16705058 SHA256: 8a76e9732e2cdc5090a02d2b53c09d730f88f70e24e3f4cf2f37374b109bdbd2 SHA1: 4625850eac55f9d540ffcc8059401ef761cadfcd MD5sum: 87177f29d9ecf95ba7c5cdaa666f0e2a Description: Source code for Hadoop This package contains the source code for Hadoop and its contrib modules. Package: hadoop-0.20-tasktracker Source: hadoop-0.20 Version: 0.20.1+169.127-1~karmic-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 188 Depends: hadoop-0.20 (= 0.20.1+169.127-1~karmic-cdh2) Provides: hadoop-tasktracker Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-tasktracker_0.20.1+169.127-1~karmic-cdh2_all.deb Size: 136324 SHA256: fb2a98b31b84e3b49ac7c2343e2356a976c613cf4b2ab6d38bc86db3b38c9d20 SHA1: 910d83bba907194fcb37a4495d236244e3b8e54a MD5sum: 7d0ff0a1d7d0a1d4330c8c476577b8c6 Description: Task Tracker for Hadoop The Task Tracker is the Hadoop service that accepts MapReduce tasks and computes results. Each node in a Hadoop cluster that should be doing computation should run a Task Tracker. Package: hadoop-hive Version: 0.4.1+14.5-2~karmic-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 12324 Depends: sun-java6-jre | sun-java6-sdk, hadoop Homepage: http://hadoop.apache.org/hive/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-hive/hadoop-hive_0.4.1+14.5-2~karmic-cdh2_all.deb Size: 10739706 SHA256: 4b90a89401d338a5f24b857a773b80bf26879b2989fa121f8dbdf7fc597ea054 SHA1: 905c6dd5c560666c18be7e018d72324ec96a1718 MD5sum: 86a3cd308b901fcf96a9aed6571bbc4e Description: A data warehouse infrastructure built on top of Hadoop Hive is a data warehouse infrastructure built on top of Hadoop that provides tools to enable easy data summarization, adhoc querying and analysis of large datasets data stored in Hadoop files. It provides a mechanism to put structure on this data and it also provides a simple query language called Hive QL which is based on SQL and which enables users familiar with SQL to query this data. At the same time, this language also allows traditional map/reduce programmers to be able to plug in their custom mappers and reducers to do more sophisticated analysis which may not be supported by the built-in capabilities of the language. Package: hadoop-pig Version: 0.5.0+11.19-1~karmic-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 79764 Depends: sun-java6-jre | sun-java6-sdk, hadoop Homepage: http://hadoop.apache.org/pig/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-pig/hadoop-pig_0.5.0+11.19-1~karmic-cdh2_all.deb Size: 30154596 SHA256: 271ff619a4a7cea7a3f414981a2d7879af18070e368a1c44d5b2db9f530bee13 SHA1: 6b56b13c80a5892f8358d22c6545c3dd81ce0852 MD5sum: 39d53d176f22af96e93e17f68507ec14 Description: A platform for analyzing large data sets using Hadoop Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, coupled with infrastructure for evaluating these programs. The salient property of Pig programs is that their structure is amenable to substantial parallelization, which in turns enables them to handle very large data sets. . At the present time, Pig's infrastructure layer consists of a compiler that produces sequences of Map-Reduce programs, for which large-scale parallel implementations already exist (e.g., the Hadoop subproject). Pig's language layer currently consists of a textual language called Pig Latin, which has the following key properties: . * Ease of programming It is trivial to achieve parallel execution of simple, "embarrassingly parallel" data analysis tasks. Complex tasks comprised of multiple interrelated data transformations are explicitly encoded as data flow sequences, making them easy to write, understand, and maintain. * Optimization opportunities The way in which tasks are encoded permits the system to optimize their execution automatically, allowing the user to focus on semantics rather than efficiency. * Extensibility Users can create their own functions to do special-purpose processing. Package: libhdfs0 Source: hadoop-0.20 Version: 0.20.1+169.127-1~karmic-cdh2 Architecture: amd64 Maintainer: Todd Lipcon Installed-Size: 224 Depends: hadoop-0.20 (= 0.20.1+169.127-1~karmic-cdh2), libc6 (>= 2.4) Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/libhdfs0_0.20.1+169.127-1~karmic-cdh2_amd64.deb Size: 151786 SHA256: 7aad064503bf10e05f3a6f8fa576ef8e095c2cc102114df38a26274b490bb1b1 SHA1: e689ad14c2f8b0e7a93e2080c6982aca5e5561e9 MD5sum: 864c9f12d4e75b3d0492867e761bf03d Description: JNI Bindings to access Hadoop HDFS from C See http://wiki.apache.org/hadoop/LibHDFS Package: libhdfs0-dev Source: hadoop-0.20 Version: 0.20.1+169.127-1~karmic-cdh2 Architecture: amd64 Maintainer: Todd Lipcon Installed-Size: 212 Depends: hadoop-0.20 (= 0.20.1+169.127-1~karmic-cdh2), libhdfs0 (= 0.20.1+169.127-1~karmic-cdh2) Homepage: http://hadoop.apache.org/core/ Priority: extra Section: libdevel Filename: pool/contrib/h/hadoop-0.20/libhdfs0-dev_0.20.1+169.127-1~karmic-cdh2_amd64.deb Size: 145116 SHA256: cefa86a61f5081f3d3d9620dffb2b929d3756095b42136304995d11e696eae6f SHA1: 2b5451fd3c75cf1cb0c71eb17f69c8662217e41c MD5sum: 5cf398398085bfe458bfbe5017c7cca0 Description: Development support for libhdfs0 Includes examples and header files for accessing HDFS from C Package: python-hive Source: hadoop-hive Version: 0.4.1+14.5-2~karmic-cdh2 Architecture: all Maintainer: Todd Lipcon Installed-Size: 636 Depends: python (>= 2.4), python-support (>= 0.90.0) Provides: python2.5-hive, python2.6-hive Homepage: http://hadoop.apache.org/hive/ Priority: extra Section: python Filename: pool/contrib/h/hadoop-hive/python-hive_0.4.1+14.5-2~karmic-cdh2_all.deb Size: 49220 SHA256: 46a74ab971476e4187872286fb8f90014351ab138a3ee3625eec830e8862859c SHA1: 75fdc9acd27898c9a7cd422e5179f9c875172add MD5sum: 6e685aeac7612ada311221aedbf9258d Description: Python client library to talk to the Hive Metastore This is a generated Thrift client to talk to the Hive Metastore.