Package: flume Version: 0.9.3+15.3-1~maverick-cdh3 Architecture: all Maintainer: Alex Newman Installed-Size: 13484 Depends: sun-java6-jre | sun-java6-sdk, hadoop-zookeeper, adduser, hadoop-0.20 (>= 0.20.2+720) Homepage: http://www.cloudera.com Priority: extra Section: misc Filename: pool/contrib/f/flume/flume_0.9.3+15.3-1~maverick-cdh3_all.deb Size: 11422336 SHA256: 04cdb48861910df781feec0bcd8729d7fe9a5351de278b090b5c6bd87075937c SHA1: b5c7b185ee522cd07dedb612dae6a5c0815f28a2 MD5sum: c17153198b2ce9bb602ba73b7dc3ac28 Description: reliable, scalable, and manageable distributed data collection application Flume is a reliable, scalable, and manageable distributed data collection application for collecting data such as logs and delivering it to data stores such as Hadoop's HDFS. It can efficiently collect, aggregate, and move large amounts of log data. It has a simple, but flexible, architecture based on streaming data flows. It is robust and fault tolerant with tunable reliability mechanisms and many failover and recovery mechanisms. The system is centrally managed and allows for intelligent dynamic management. It uses a simple extensible data model that allows for online analytic applications. Package: flume-master Source: flume Version: 0.9.3+15.3-1~maverick-cdh3 Architecture: all Maintainer: Alex Newman Installed-Size: 60 Depends: flume (= 0.9.3+15.3-1~maverick-cdh3) Homepage: http://www.cloudera.com Priority: extra Section: misc Filename: pool/contrib/f/flume/flume-master_0.9.3+15.3-1~maverick-cdh3_all.deb Size: 4510 SHA256: ec4dae6e8709dcf9693d45af4b33b73c48484a92a5d4402a2ebd24da526d76f0 SHA1: 49ea385a84716e3db2e01c138388529994bb08c6 MD5sum: 8fc43d144c3184c5761923f4eefee24b Description: central administration point for the flume data collection system The Flume master daemon is the central administration and data path control point for flume nodes. Package: flume-node Source: flume Version: 0.9.3+15.3-1~maverick-cdh3 Architecture: all Maintainer: Alex Newman Installed-Size: 60 Depends: flume (= 0.9.3+15.3-1~maverick-cdh3) Homepage: http://www.cloudera.com Priority: extra Section: misc Filename: pool/contrib/f/flume/flume-node_0.9.3+15.3-1~maverick-cdh3_all.deb Size: 4522 SHA256: c52c5d0d7a059e0bf6e0d9204a295e6df8081828fcc03145703c9bfbec7240f9 SHA1: d76df9f68c160b26d893149a547b7cbffb85b1fe MD5sum: cc585963c647fbb5e2f6d2eb449ff0ef Description: core element of Flume's data path that collects and delivers data The Flume node daemon is a core element of flume's data path and is responsible for generating, processing, and delivering data. Package: hadoop-0.20 Version: 0.20.2+923.21-1~maverick-cdh3 Architecture: all Maintainer: Todd Lipcon Installed-Size: 88952 Depends: adduser, sun-java6-jre, sun-java6-bin Recommends: hadoop-0.20-native Provides: hadoop Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20_0.20.2+923.21-1~maverick-cdh3_all.deb Size: 40856230 SHA256: 69ce65cf51abc1f0931456454090d5c9f05018f5f5e64ae26c726718f44643a8 SHA1: a2f472746c9df4395a1b50c1586cce7604357b8d MD5sum: c3f13bfeea030502c7357874dac9315a Description: A software platform for processing vast amounts of data Hadoop is a software platform that lets one easily write and run applications that process vast amounts of data. . Here's what makes Hadoop especially useful: * Scalable: Hadoop can reliably store and process petabytes. * Economical: It distributes the data and processing across clusters of commonly available computers. These clusters can number into the thousands of nodes. * Efficient: By distributing the data, Hadoop can process it in parallel on the nodes where the data is located. This makes it extremely rapid. * Reliable: Hadoop automatically maintains multiple copies of data and automatically redeploys computing tasks based on failures. . Hadoop implements MapReduce, using the Hadoop Distributed File System (HDFS). MapReduce divides applications into many small blocks of work. HDFS creates multiple replicas of data blocks for reliability, placing them on compute nodes around the cluster. MapReduce can then process the data where it is located. Package: hadoop-0.20-conf-pseudo Source: hadoop-0.20 Version: 0.20.2+923.21-1~maverick-cdh3 Architecture: all Maintainer: Todd Lipcon Installed-Size: 360 Depends: hadoop-0.20 (= 0.20.2+923.21-1~maverick-cdh3), hadoop-0.20-namenode (= 0.20.2+923.21-1~maverick-cdh3), hadoop-0.20-datanode (= 0.20.2+923.21-1~maverick-cdh3), hadoop-0.20-secondarynamenode (= 0.20.2+923.21-1~maverick-cdh3), hadoop-0.20-jobtracker (= 0.20.2+923.21-1~maverick-cdh3), hadoop-0.20-tasktracker (= 0.20.2+923.21-1~maverick-cdh3) Provides: hadoop-conf-pseudo Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-conf-pseudo_0.20.2+923.21-1~maverick-cdh3_all.deb Size: 235904 SHA256: bb34df2a6fb45f8d3a3a2ff177f9a830ca744f097305a284d6cde895ba90d14c SHA1: 12b10a836d8259dba198024f6bfb28f6c674b43d MD5sum: 22660f5f8b7b95244445ff569599d2f7 Description: Pseudo-distributed Hadoop configuration Contains configuration files for a "pseudo-distributed" Hadoop deployment. In this mode, each of the hadoop components runs as a separate Java process, but all on the same machine. Package: hadoop-0.20-datanode Source: hadoop-0.20 Version: 0.20.2+923.21-1~maverick-cdh3 Architecture: all Maintainer: Todd Lipcon Installed-Size: 288 Depends: hadoop-0.20 (= 0.20.2+923.21-1~maverick-cdh3) Breaks: hadoop-0.20 (<< 0.20.2+737-1) Replaces: hadoop-0.20 (<< 0.20.2+737-1) Provides: hadoop-datanode Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-datanode_0.20.2+923.21-1~maverick-cdh3_all.deb Size: 229322 SHA256: f9af5eb54eb7452e156aca5fd3d53b067f8c16982344fbfd016c2805f1cad42e SHA1: c22cc43ea25068d9c42dee3193e38d2163521ede MD5sum: 094731118a9146026ccf17d849a3c1fb Description: Data Node for Hadoop The Data Nodes in the Hadoop Cluster are responsible for serving up blocks of data over the network to Hadoop Distributed Filesystem (HDFS) clients. Package: hadoop-0.20-doc Source: hadoop-0.20 Version: 0.20.2+923.21-1~maverick-cdh3 Architecture: all Maintainer: Todd Lipcon Installed-Size: 56884 Provides: hadoop-doc Homepage: http://hadoop.apache.org/core/ Priority: extra Section: doc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-doc_0.20.2+923.21-1~maverick-cdh3_all.deb Size: 6531098 SHA256: da91cbfb66cb2c1f0d6ed0124a1dfd895779d42858c5c799a6add73f74fbca70 SHA1: 251e75eebbc84eee167d5c38f2744126627c18da MD5sum: 4aecb246f8e5e73a855386727ffa1999 Description: Documentation for Hadoop This package contains the Java Documentation for Hadoop and its relevant APIs. Package: hadoop-0.20-fuse Source: hadoop-0.20 Version: 0.20.2+923.21-1~maverick-cdh3 Architecture: i386 Maintainer: Todd Lipcon Installed-Size: 384 Depends: libc6 (>= 2.7), libfuse2 (>= 2.8.1), libhdfs0, hadoop-0.20 (= 0.20.2+923.21-1~maverick-cdh3), fuse-utils Enhances: hadoop-0.20 Provides: hadoop-fuse Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-fuse_0.20.2+923.21-1~maverick-cdh3_i386.deb Size: 258708 SHA256: 317097e93793687cebb54ce10a53f70a87dfc7c837ebc6ad5693e34bf9296589 SHA1: f8495a2d6cef0b70b07b9b603b4fffa813d9d87e MD5sum: 5b2a49dee9e7681425a964e6aff87abd Description: HDFS exposed over a Filesystem in Userspace These projects (enumerated below) allow HDFS to be mounted (on most flavors of Unix) as a standard file system using the mount command. Once mounted, the user can operate on an instance of hdfs using standard Unix utilities such as 'ls', 'cd', 'cp', 'mkdir', 'find', 'grep', or use standard Posix libraries like open, write, read, close from C, C++, Python, Ruby, Perl, Java, bash, etc. Package: hadoop-0.20-jobtracker Source: hadoop-0.20 Version: 0.20.2+923.21-1~maverick-cdh3 Architecture: all Maintainer: Todd Lipcon Installed-Size: 288 Depends: hadoop-0.20 (= 0.20.2+923.21-1~maverick-cdh3) Breaks: hadoop-0.20 (<< 0.20.2+737-1) Replaces: hadoop-0.20 (<< 0.20.2+737-1) Provides: hadoop-jobtracker Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-jobtracker_0.20.2+923.21-1~maverick-cdh3_all.deb Size: 229350 SHA256: 339cc71b3b438be86443fa09bb1c9280e17c08f566c9b5a22945d9a3e5d8e8d1 SHA1: 6d545ce1653e57e2a3b1e3df238619814f08fa4f MD5sum: 711fa01abf3d235155773330a3795567 Description: Job Tracker for Hadoop The jobtracker is a central service which is responsible for managing the tasktracker services running on all nodes in a Hadoop Cluster. The jobtracker allocates work to the tasktracker nearest to the data with an available work slot. Package: hadoop-0.20-namenode Source: hadoop-0.20 Version: 0.20.2+923.21-1~maverick-cdh3 Architecture: all Maintainer: Todd Lipcon Installed-Size: 288 Depends: hadoop-0.20 (= 0.20.2+923.21-1~maverick-cdh3) Breaks: hadoop-0.20 (<< 0.20.2+737-1) Replaces: hadoop-0.20 (<< 0.20.2+737-1) Provides: hadoop-namenode Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-namenode_0.20.2+923.21-1~maverick-cdh3_all.deb Size: 229316 SHA256: 4e3f55b4ac0bbd5dbb03af8570f956ef514ab0ea675699e4a27b29b26849a6d1 SHA1: baf96f72670b7d166e5fb92e80921b43f68d5df0 MD5sum: be007b5430677194408b5c71f4726925 Description: Name Node for Hadoop The Hadoop Distributed Filesystem (HDFS) requires one unique server, the namenode, which manages the block locations of files on the filesystem. Package: hadoop-0.20-native Source: hadoop-0.20 Version: 0.20.2+923.21-1~maverick-cdh3 Architecture: i386 Maintainer: Todd Lipcon Installed-Size: 356 Depends: libc6 (>= 2.7), hadoop-0.20 (= 0.20.2+923.21-1~maverick-cdh3), liblzo2-2, libz1 Enhances: hadoop-0.20 Provides: hadoop-native Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-native_0.20.2+923.21-1~maverick-cdh3_i386.deb Size: 243624 SHA256: 5b1e00e03d88776d2073c35bce049520ac734891d5d219d4dd69e0929cc0550a SHA1: 472c50b0894317ff46495b4fe3a92fde9fecb9c9 MD5sum: b3046bccf162105c058919589a7ef804 Description: Native libraries for Hadoop (e.g., compression) This optional package contains native libraries that increase the performance of Hadoop's compression. Package: hadoop-0.20-pipes Source: hadoop-0.20 Version: 0.20.2+923.21-1~maverick-cdh3 Architecture: i386 Maintainer: Todd Lipcon Installed-Size: 468 Depends: hadoop-0.20 (= 0.20.2+923.21-1~maverick-cdh3) Provides: hadoop-pipes Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-pipes_0.20.2+923.21-1~maverick-cdh3_i386.deb Size: 286822 SHA256: 4946f36a86bcca0fb591bd669243e18768b226147ac4aeb2105cc349ddcdd54d SHA1: bf651a1ab9ab013ca40e5dfadbbbc7882c9e3e51 MD5sum: 251e7fd6791e529573841b5cbcf66b96 Description: Interface to author Hadoop MapReduce jobs in C++ Contains Hadoop Pipes, a library which allows Hadoop MapReduce jobs to be written in C++. Package: hadoop-0.20-sbin Source: hadoop-0.20 Version: 0.20.2+923.21-1~maverick-cdh3 Architecture: i386 Maintainer: Todd Lipcon Installed-Size: 332 Depends: libc6 (>= 2.4), hadoop-0.20 (= 0.20.2+923.21-1~maverick-cdh3) Enhances: hadoop-0.20 Provides: hadoop-sbin Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-sbin_0.20.2+923.21-1~maverick-cdh3_i386.deb Size: 251172 SHA256: 9bbd7ff3a9dfc8df5817884d98aa6f25fed05a230ccdcbd7d2fd6742bf5ad423 SHA1: 5771c9d1fcb7367b814fc81458debea494f9d138 MD5sum: 77fa89d6202060fa580c589e3d351fe8 Description: Server-side binaries necessary for secured Hadoop clusters This package contains a setuid program, 'task-controller', which is used for launching MapReduce tasks in a secured MapReduce cluster. This program allows the tasks to run as the Unix user who submitted the job, rather than the Unix user running the MapReduce daemons. . This package also contains 'jsvc', a daemon wrapper necessary to allow DataNodes to bind to a low (privileged) port and then drop root privileges before continuing operation. Package: hadoop-0.20-secondarynamenode Source: hadoop-0.20 Version: 0.20.2+923.21-1~maverick-cdh3 Architecture: all Maintainer: Todd Lipcon Installed-Size: 288 Depends: hadoop-0.20 (= 0.20.2+923.21-1~maverick-cdh3) Breaks: hadoop-0.20 (<< 0.20.2+737-1) Replaces: hadoop-0.20 (<< 0.20.2+737-1) Provides: hadoop-secondarynamenode Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-secondarynamenode_0.20.2+923.21-1~maverick-cdh3_all.deb Size: 229356 SHA256: 65e06d678bbaf71d1087d7f46b64501666a644e77b2d01ca54edaae10e0b3058 SHA1: cd4fd84aaa6c1dc3ba6da9102868d5e93ce04177 MD5sum: 900161de4673bde22449bf1f8ae8fb1c Description: Secondary Name Node for Hadoop The Secondary Name Node is responsible for checkpointing file system images. It is _not_ a failover pair for the namenode, and may safely be run on the same machine. Package: hadoop-0.20-source Source: hadoop-0.20 Version: 0.20.2+923.21-1~maverick-cdh3 Architecture: all Maintainer: Todd Lipcon Installed-Size: 57924 Provides: hadoop-source Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-source_0.20.2+923.21-1~maverick-cdh3_all.deb Size: 18075306 SHA256: 6760195c3005621fff4e784c4a61db477e52748635009727c6e460b6532f4601 SHA1: a7cff4d553faa579acd2b93f9552b2f6887fb354 MD5sum: 07adbee4e805767d2c1fe171a0c05f56 Description: Source code for Hadoop This package contains the source code for Hadoop and its contrib modules. Package: hadoop-0.20-tasktracker Source: hadoop-0.20 Version: 0.20.2+923.21-1~maverick-cdh3 Architecture: all Maintainer: Todd Lipcon Installed-Size: 288 Depends: hadoop-0.20 (= 0.20.2+923.21-1~maverick-cdh3) Breaks: hadoop-0.20 (<< 0.20.2+737-1) Replaces: hadoop-0.20 (<< 0.20.2+737-1) Provides: hadoop-tasktracker Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/hadoop-0.20-tasktracker_0.20.2+923.21-1~maverick-cdh3_all.deb Size: 229328 SHA256: b111dda0b14735516ae62a6dc2e47162d97f2b975625fda0e263e0292120083f SHA1: 24653b1c7c1cd397ce2673cd5de5811320566fa0 MD5sum: f9808713a280ba1913335ce0085320fc Description: Task Tracker for Hadoop The Task Tracker is the Hadoop service that accepts MapReduce tasks and computes results. Each node in a Hadoop cluster that should be doing computation should run a Task Tracker. Package: hadoop-hbase Version: 0.90.1+15.18-1~maverick-cdh3 Architecture: all Maintainer: Alex Newman Installed-Size: 22776 Depends: adduser, sun-java6-jre | sun-java6-sdk, hadoop-zookeeper (>= 3.3.1+8), hadoop-0.20 (>= 0.20.2+700) Recommends: ntp Homepage: http://hadoop.apache.org/hbase/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-hbase/hadoop-hbase_0.90.1+15.18-1~maverick-cdh3_all.deb Size: 20835558 SHA256: e83dec30c677497a1ac0ea44e0b461caf0f4493203bc6c85cf3e7e9fba061d08 SHA1: 642b45086f0725b521474f1be3b4a9bd1bbf48fb MD5sum: 32169fbfdbba4e97609fedda55bb6cf6 Description: HBase is the Hadoop database Use it when you need random, realtime read/write access to your Big Data. This project's goal is the hosting of very large tables -- billions of rows X millions of columns -- atop clusters of commodity hardware. Package: hadoop-hbase-doc Source: hadoop-hbase Version: 0.90.1+15.18-1~maverick-cdh3 Architecture: all Maintainer: Alex Newman Installed-Size: 1012 Homepage: http://hadoop.apache.org/hbase/ Priority: extra Section: doc Filename: pool/contrib/h/hadoop-hbase/hadoop-hbase-doc_0.90.1+15.18-1~maverick-cdh3_all.deb Size: 547126 SHA256: 7601021f53f0af5e538d5d753043fb94e2a16119b23bcca4a4b938033471c8a0 SHA1: 550760ae7e9bd7d936954ff943e983931b18c19c MD5sum: 2dafcb9a4dff41c2f0560319a693b4c6 Description: Documentation for HBase This package contains the HBase manual and JavaDoc. Package: hadoop-hbase-master Source: hadoop-hbase Version: 0.90.1+15.18-1~maverick-cdh3 Architecture: all Maintainer: Alex Newman Installed-Size: 76 Depends: hadoop-hbase (= 0.90.1+15.18-1~maverick-cdh3) Homepage: http://hadoop.apache.org/hbase/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-hbase/hadoop-hbase-master_0.90.1+15.18-1~maverick-cdh3_all.deb Size: 7222 SHA256: a27fbff99aaec003e9d4abd59eb810a7f785addfe4a64da1e1b4e83bed4c80a6 SHA1: 9fc04a0e2023385e5e6c5c02f3978bab5c130687 MD5sum: a2210936a7922c8bdc17273099645ec8 Description: HMaster is the "master server" for a HBase There is only one HMaster for a single HBase deployment. Package: hadoop-hbase-regionserver Source: hadoop-hbase Version: 0.90.1+15.18-1~maverick-cdh3 Architecture: all Maintainer: Alex Newman Installed-Size: 76 Depends: hadoop-hbase (= 0.90.1+15.18-1~maverick-cdh3) Homepage: http://hadoop.apache.org/hbase/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-hbase/hadoop-hbase-regionserver_0.90.1+15.18-1~maverick-cdh3_all.deb Size: 7256 SHA256: 67fc329b1a396d7f7e84ec90282185eeba7e6076db69cc5ace4fef1d9e372f2a SHA1: 074645485e5ef0f3b9752235b2651facf242931b MD5sum: 192f0fd91b8b92c04ba1136dfcc3b925 Description: HRegionServer makes a set of HRegions available to clients It checks in with the HMaster. There are many HRegionServers in a single HBase deployment. Package: hadoop-hbase-thrift Source: hadoop-hbase Version: 0.90.1+15.18-1~maverick-cdh3 Architecture: all Maintainer: Alex Newman Installed-Size: 76 Depends: hadoop-hbase (= 0.90.1+15.18-1~maverick-cdh3) Homepage: http://hadoop.apache.org/hbase/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-hbase/hadoop-hbase-thrift_0.90.1+15.18-1~maverick-cdh3_all.deb Size: 7228 SHA256: 0251ad64c2f2b608302ef0fc6eef7c3b5eb18b81b2903810de03b5efc7f95dfb SHA1: 60c0a1161723df4ab055467073eb80a99c274e2d MD5sum: e99f6e4e2785cdb169f0f840b8f69ce8 Description: Provides an HBase Thrift service This package provides a Thrift service interface to the HBase distributed database. Package: hadoop-hive Version: 0.7.0+27.1-2~maverick-cdh3 Architecture: all Maintainer: Todd Lipcon Installed-Size: 22036 Depends: sun-java6-jre | sun-java6-sdk, hadoop Homepage: http://hadoop.apache.org/hive/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-hive/hadoop-hive_0.7.0+27.1-2~maverick-cdh3_all.deb Size: 18606960 SHA256: ddc66a1f74792ee319f2064b45fff9d865812d1887e203c04acab8d2efa8ed0f SHA1: d5793f4c82d05d8bf012f40c02c1658e54a42806 MD5sum: 8ccb1a1c2f813cf8b0900c5737d9c71c Description: A data warehouse infrastructure built on top of Hadoop Hive is a data warehouse infrastructure built on top of Hadoop that provides tools to enable easy data summarization, adhoc querying and analysis of large datasets data stored in Hadoop files. It provides a mechanism to put structure on this data and it also provides a simple query language called Hive QL which is based on SQL and which enables users familiar with SQL to query this data. At the same time, this language also allows traditional map/reduce programmers to be able to plug in their custom mappers and reducers to do more sophisticated analysis which may not be supported by the built-in capabilities of the language. Package: hadoop-pig Version: 0.8.0+20.3-1~maverick-cdh3 Architecture: all Maintainer: Todd Lipcon Installed-Size: 104112 Depends: default-jre-headless | sun-java6-jre | sun-java6-sdk, hadoop Homepage: http://hadoop.apache.org/pig/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-pig/hadoop-pig_0.8.0+20.3-1~maverick-cdh3_all.deb Size: 35874314 SHA256: 8e3c60beec452731a2616d8ce8dabc39a23d9425d2954f17349769567b063446 SHA1: 753e95cb6e651d2d06d12f53ac3ce44332f9d360 MD5sum: 7810dae828bd94b62842763cee7b4226 Description: A platform for analyzing large data sets using Hadoop Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, coupled with infrastructure for evaluating these programs. The salient property of Pig programs is that their structure is amenable to substantial parallelization, which in turns enables them to handle very large data sets. . At the present time, Pig's infrastructure layer consists of a compiler that produces sequences of Map-Reduce programs, for which large-scale parallel implementations already exist (e.g., the Hadoop subproject). Pig's language layer currently consists of a textual language called Pig Latin, which has the following key properties: . * Ease of programming It is trivial to achieve parallel execution of simple, "embarrassingly parallel" data analysis tasks. Complex tasks comprised of multiple interrelated data transformations are explicitly encoded as data flow sequences, making them easy to write, understand, and maintain. * Optimization opportunities The way in which tasks are encoded permits the system to optimize their execution automatically, allowing the user to focus on semantics rather than efficiency. * Extensibility Users can create their own functions to do special-purpose processing. Package: hadoop-zookeeper Version: 3.3.3+12.1-1~maverick-cdh3 Architecture: all Maintainer: Alex Newman Installed-Size: 6408 Depends: sun-java6-jre | sun-java6-sdk Conflicts: zookeeper Replaces: zookeeper Homepage: http://hadoop.apache.org/zookeeper/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-zookeeper/hadoop-zookeeper_3.3.3+12.1-1~maverick-cdh3_all.deb Size: 2939658 SHA256: 4905c9b1bc74c489cf5896184c3c112abe067deea5970a2ea7c80934d124cc55 SHA1: 4727c21bf52ad47290a4085b3848b782ed91d643 MD5sum: 08b988eb92e25d46308beebd3bb6af77 Description: A high-performance coordination service for distributed applications. ZooKeeper is a centralized service for maintaining configuration information, naming, providing distributed synchronization, and providing group services. All of these kinds of services are used in some form or another by distributed applications. Each time they are implemented there is a lot of work that goes into fixing the bugs and race conditions that are inevitable. Because of the difficulty of implementing these kinds of services, applications initially usually skimp on them ,which make them brittle in the presence of change and difficult to manage. Even when done correctly, different implementations of these services lead to management complexity when the applications are deployed. Package: hadoop-zookeeper-server Source: hadoop-zookeeper Version: 3.3.3+12.1-1~maverick-cdh3 Architecture: all Maintainer: Alex Newman Installed-Size: 64 Depends: hadoop-zookeeper (= 3.3.3+12.1-1~maverick-cdh3) Homepage: http://hadoop.apache.org/zookeeper/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-zookeeper/hadoop-zookeeper-server_3.3.3+12.1-1~maverick-cdh3_all.deb Size: 5818 SHA256: 592231ac862f345335ebb74c8ae188f97ce289bd911b7f81080191a948ae6fca SHA1: d4867b103348d0ab81d46ef3d8ca58389014454a MD5sum: 14b8f88050267ebe76eb782a42d9aa8e Description: This runs the zookeeper server on startup. Package: hue Source: hue-common Version: 1.2.0.0+54-1~maverick-cdh3 Architecture: all Maintainer: bc Wong Installed-Size: 40 Depends: hadoop-0.20 (>= 0.20.2+730), hue-plugins (= 1.2.0.0+54-1~maverick-cdh3), hue-common (= 1.2.0.0+54-1~maverick-cdh3), sun-java6-bin, hue-user, hue-about (= 1.2.0.0+54-1~maverick-cdh3), hue-help (= 1.2.0.0+54-1~maverick-cdh3), hue-filebrowser (= 1.2.0.0+54-1~maverick-cdh3), hue-jobbrowser (= 1.2.0.0+54-1~maverick-cdh3), hue-jobsub (= 1.2.0.0+54-1~maverick-cdh3), hue-beeswax (= 1.2.0.0+54-1~maverick-cdh3), hue-proxy (= 1.2.0.0+54-1~maverick-cdh3) Homepage: http://github.com/cloudera/hue Priority: extra Section: misc Filename: pool/contrib/h/hue-common/hue_1.2.0.0+54-1~maverick-cdh3_all.deb Size: 5380 SHA256: f20f1df961bbead89d520d3658ba9033b2986d39ed821be0a973804ef7e7e9c1 SHA1: b75589c6117b17fe085edbe0a973da4254bbdbf1 MD5sum: 846b94553ea33e5e10a6897bd2bad8e5 Description: The hue metapackage Package: hue-about Version: 1.2.0.0+54-1~maverick-cdh3 Architecture: all Maintainer: bc Wong Installed-Size: 160 Depends: python (>= 2.4), python (<< 3), make (>= 3.8), hue-common (>= 1.2), hadoop-0.20 (>= 0.20.2+730) Homepage: http://github.com/cloudera/hue Priority: extra Section: misc Filename: pool/contrib/h/hue-about/hue-about_1.2.0.0+54-1~maverick-cdh3_all.deb Size: 30994 SHA256: d906a3cae1b12dc96f6a5f4ebbc995a61c6e1eecc9764e3866f3a70287061a07 SHA1: fed61504d0556ac8289d9b3a78a8d9323f82715a MD5sum: b78b08ffe950ab2bfbe4cd0c67ed8fc9 Description: Show version and configuration information about Hue Displays the current version and configuration information about your Hue installation. Package: hue-beeswax Version: 1.2.0.0+54-1~maverick-cdh3 Architecture: all Maintainer: bc Wong Installed-Size: 3384 Depends: python (>= 2.4), python (<< 3), make (>= 3.8), hue-common (>= 1.2), hue-jobbrowser, hue-jobsub, hue-filebrowser, hue-help, hadoop-hive (>= 0.7.0) Homepage: http://github.com/cloudera/hue Priority: extra Section: misc Filename: pool/contrib/h/hue-beeswax/hue-beeswax_1.2.0.0+54-1~maverick-cdh3_all.deb Size: 1562424 SHA256: df25a4acdb1574b06922322d9ed9e40f7802251e2561f29fe25666507d92540d SHA1: 2bbac1f0809cf9cffeb219ba4039f93fac2e9b04 MD5sum: 7947a6209d927dc7dc8fcd5e74508451 Description: A UI for Hive on Hue Beeswax is a web interface for Hive. . It allows users to construct and run queries on Hive, manage tables, and import and export data. Package: hue-common Version: 1.2.0.0+54-1~maverick-cdh3 Architecture: i386 Maintainer: bc Wong Installed-Size: 59168 Depends: python2.6, libsasl2-modules-gssapi-mit, libxslt1.1, make, python-setuptools Recommends: hadoop-0.20, hue-useradmin, hue-about, hue-help, hue-filebrowser, hue-jobbrowser, hue-jobsub, hue-beeswax, hue-proxy Conflicts: cloudera-desktop Homepage: http://github.com/cloudera/hue Priority: extra Section: misc Filename: pool/contrib/h/hue-common/hue-common_1.2.0.0+54-1~maverick-cdh3_i386.deb Size: 38128206 SHA256: 9059c536e3e30aa9b23c3680f6fd4a276745d07b3e1d1463fd8894bd7ac322f6 SHA1: d7df1d9178bda907c7d87418508e3b7d41ca3470 MD5sum: 1cc8f0f03f4661d74154855df7b54d58 Description: A browser-based desktop interface for Hadoop Hue is a browser-based desktop interface for interacting with Hadoop. It supports a file browser, job tracker interface, cluster health monitor, and more. Package: hue-filebrowser Version: 1.2.0.0+54-1~maverick-cdh3 Architecture: all Maintainer: bc Wong Installed-Size: 3576 Depends: python (>= 2.4), python (<< 3), make (>= 3.8), hue-common (>= 1.2), hadoop-0.20 (>= 0.20.2+730), hue-help Homepage: http://github.com/cloudera/hue Priority: extra Section: misc Filename: pool/contrib/h/hue-filebrowser/hue-filebrowser_1.2.0.0+54-1~maverick-cdh3_all.deb Size: 991280 SHA256: 040167495077b0730b288e7945bce10115a5e6cb3de22680d470a39ea6ac45a7 SHA1: 3512465bf3c1a1d22410cdb47fa6defc3aa820d6 MD5sum: 0da8cb287d20193a38b71efbd44a80c8 Description: A UI for the Hadoop Distributed File System (HDFS) Filebrowser is a graphical web interface that lets you browse and interact with the Hadoop Distributed File System (HDFS). Package: hue-help Version: 1.2.0.0+54-1~maverick-cdh3 Architecture: all Maintainer: bc Wong Installed-Size: 140 Depends: python (>= 2.4), python (<< 3), make (>= 3.8), hue-common (>= 1.2), hadoop-0.20 (>= 0.20.2+730) Homepage: http://github.com/cloudera/hue Priority: extra Section: misc Filename: pool/contrib/h/hue-help/hue-help_1.2.0.0+54-1~maverick-cdh3_all.deb Size: 9540 SHA256: 4727eec4ff7e72853601444fc463232b300e776a01575671d50dee8f061550a2 SHA1: 6ed10004701a4f8fd547242b2a44f374bc982316 MD5sum: e0395aee9d3c327d184682886b2c722a Description: Display help documentation for various Hue apps Displays help documentation for various Hue apps. Package: hue-jobbrowser Version: 1.2.0.0+54-1~maverick-cdh3 Architecture: all Maintainer: bc Wong Installed-Size: 1928 Depends: python (>= 2.4), python (<< 3), make (>= 3.8), hue-common (>= 1.2), hadoop-0.20 (>= 0.20.2+730), hue-filebrowser, hue-help Homepage: http://github.com/cloudera/hue Priority: extra Section: misc Filename: pool/contrib/h/hue-jobbrowser/hue-jobbrowser_1.2.0.0+54-1~maverick-cdh3_all.deb Size: 780108 SHA256: 9907197652b747d2dfe03abce9ff564217971a28c640ac38601bebd3475deddf SHA1: 281ee325f932fd45733f354f7cbcbe4423472b97 MD5sum: 77ad1ec5b2bff52b536f52f05d74c280 Description: A UI for viewing Hadoop map-reduce jobs Jobbrowser is a web interface for viewing Hadoop map-reduce jobs running on your cluster. Package: hue-jobsub Version: 1.2.0.0+54-1~maverick-cdh3 Architecture: all Maintainer: bc Wong Installed-Size: 2064 Depends: python (>= 2.4), python (<< 3), make (>= 3.8), hue-common (>= 1.2), hadoop-0.20 (>= 0.20.2+730), hue-jobbrowser, hue-help Homepage: http://github.com/cloudera/hue Priority: extra Section: misc Filename: pool/contrib/h/hue-jobsub/hue-jobsub_1.2.0.0+54-1~maverick-cdh3_all.deb Size: 817362 SHA256: 121fe7227965d866276830a6799b7e2791c401b2688b8263d8fa78a43c7e5ca4 SHA1: 8f609e57b8613f5c27d10807402d6a5ae808ad3c MD5sum: 082fd44945eedd83911150331ebdd620 Description: A UI for designing and submitting map-reduce jobs to Hadoop Jobsub is a web interface for designing and submitting map-reduce jobs to Hadoop. Package: hue-plugins Source: hue-common Version: 1.2.0.0+54-1~maverick-cdh3 Architecture: all Maintainer: bc Wong Installed-Size: 1680 Depends: hadoop-0.20 (>= 0.20.2+730) Conflicts: cloudera-desktop-plugins Homepage: http://github.com/cloudera/hue Priority: extra Section: misc Filename: pool/contrib/h/hue-common/hue-plugins_1.2.0.0+54-1~maverick-cdh3_all.deb Size: 1543566 SHA256: 88cddfe88bd6bd939166070bf9d44abac742630e258f388afbf6f3497feba2d8 SHA1: fdb8d13928276ec96a52c30758e5e991d7467794 MD5sum: d3fdc595b4a0eaca520664f79334f35b Description: Plug-ins for Hadoop to enable integration with Hue These plug-ins enable the Hadoop Daemons to communicate with Hue. This package must be installed on every node in the Hadoop cluster. Package: hue-proxy Version: 1.2.0.0+54-1~maverick-cdh3 Architecture: all Maintainer: bc Wong Installed-Size: 104 Depends: python (>= 2.4), python (<< 3), make (>= 3.8), hue-common (>= 1.2), hadoop-0.20 (>= 0.20.2+730) Homepage: http://github.com/cloudera/hue Priority: extra Section: misc Filename: pool/contrib/h/hue-proxy/hue-proxy_1.2.0.0+54-1~maverick-cdh3_all.deb Size: 7250 SHA256: 55c441bae90ea8b8c6419fb9ba6e7921f9dc7a81d1d892a374f6fb6bd530c06d SHA1: fe91bb02423a0f12ee28b0dfeb0a3021896ae63d MD5sum: 76767e8e9d2029d57e3ca03bf55426da Description: Reverse proxy for the Hue server Proxies HTTP requests through the Hue server. This is intended to be used for "built-in" UIs. Package: hue-useradmin Version: 1.2.0.0+54-1~maverick-cdh3 Architecture: all Maintainer: bc Wong Installed-Size: 220 Depends: python (>= 2.4), python (<< 3), make (>= 3.8), hue-common (>= 1.2), hadoop-0.20 (>= 0.20.2+730), hue-filebrowser, hue-about, hue-help Provides: hue-user Homepage: http://github.com/cloudera/hue Priority: extra Section: misc Filename: pool/contrib/h/hue-useradmin/hue-useradmin_1.2.0.0+54-1~maverick-cdh3_all.deb Size: 34632 SHA256: c537791d056eaa29617614c963bc0ed6989359d5651b5233cb9c7fb71aa4ab8e SHA1: 8ed2c02f314f5a6e1751c2bf73c69044b1b60fb7 MD5sum: cb637ed7230af0d8b3c0e16b7f708ae0 Description: Create/delete users, update user information Create/delete Hue users, and update user information (name, email, superuser status, etc.) Package: libhdfs0 Source: hadoop-0.20 Version: 0.20.2+923.21-1~maverick-cdh3 Architecture: i386 Maintainer: Todd Lipcon Installed-Size: 300 Depends: hadoop-0.20 (= 0.20.2+923.21-1~maverick-cdh3), libc6 (>= 2.4) Homepage: http://hadoop.apache.org/core/ Priority: extra Section: misc Filename: pool/contrib/h/hadoop-0.20/libhdfs0_0.20.2+923.21-1~maverick-cdh3_i386.deb Size: 240422 SHA256: 7fe8b87dc286d361dcba5da52370ce2c12c193dd6f232fb534d630083fcfef84 SHA1: 3df8b2c7c229913f0eb0e61fbfb5a43ee09e4214 MD5sum: 463f4be26c9b01e0df7abf2704dcc1c7 Description: JNI Bindings to access Hadoop HDFS from C See http://wiki.apache.org/hadoop/LibHDFS Package: libhdfs0-dev Source: hadoop-0.20 Version: 0.20.2+923.21-1~maverick-cdh3 Architecture: i386 Maintainer: Todd Lipcon Installed-Size: 292 Depends: hadoop-0.20 (= 0.20.2+923.21-1~maverick-cdh3), libhdfs0 (= 0.20.2+923.21-1~maverick-cdh3) Homepage: http://hadoop.apache.org/core/ Priority: extra Section: libdevel Filename: pool/contrib/h/hadoop-0.20/libhdfs0-dev_0.20.2+923.21-1~maverick-cdh3_i386.deb Size: 236456 SHA256: 51842700e10272b9691d26aea36a1ba87793473c5fd4e10b3d02b6b5f1d8e465 SHA1: 1abc6597eec57c8b9f4a759743e07314fd81a5ce MD5sum: 8efbd157e41d2bac11ccf3300bd15bc1 Description: Development support for libhdfs0 Includes examples and header files for accessing HDFS from C Package: oozie Version: 2.3.0+31.2-1~maverick-cdh3 Architecture: all Maintainer: Arvind Prabhakar Installed-Size: 53308 Depends: oozie-client (= 2.3.0+31.2-1~maverick-cdh3) Homepage: http://archive.cloudera.com/cdh/3/oozie Priority: extra Section: misc Filename: pool/contrib/o/oozie/oozie_2.3.0+31.2-1~maverick-cdh3_all.deb Size: 53638674 SHA256: 24075704dad74eb47df314917c90e5d696ff2ef39a3ad24f90977a086be99098 SHA1: 94b2000459b3a0974448ca480f35c75dfeefdc3b MD5sum: a2beac7abf5ec20d2dd149176884c60d Description: A workflow and coordinator sytem for Hadoop jobs. Oozie workflows are actions arranged in a control dependency DAG (Direct Acyclic Graph). Oozie coordinator functionality allows to start workflows at regular frequencies and when data becomes available in HDFS. . An Oozie workflow may contain the following types of actions nodes: map-reduce, map-reduce streaming, map-reduce pipes, pig, file-system, sub-workflows, java, hive, sqoop and ssh (deprecated). . Flow control operations within the workflow can be done using decision, fork and join nodes. Cycles in workflows are not supported. . Actions and decisions can be parameterized with job properties, actions output (i.e. Hadoop counters) and HDFS file information (file exists, file size, etc). Formal parameters are expressed in the workflow definition as variables. . A Workflow application is an HDFS directory that contains the workflow definition (an XML file), all the necessary files to run all the actions: JAR files for Map/Reduce jobs, shells for streaming Map/Reduce jobs, native libraries, Pig scripts, and other resource files. . Running workflow jobs is done via command line tools, a WebServices API or a Java API. . Monitoring the system and workflow jobs can be done via a web console, the command line tools, the WebServices API and the Java API. . Oozie is a transactional system and it has built in automatic and manual retry capabilities. . In case of workflow job failure, the workflow job can be rerun skipping previously completed actions, the workflow application can be patched before being rerun. Package: oozie-client Source: oozie Version: 2.3.0+31.2-1~maverick-cdh3 Architecture: all Maintainer: Arvind Prabhakar Installed-Size: 47264 Homepage: http://archive.cloudera.com/cdh/3/oozie Priority: extra Section: misc Filename: pool/contrib/o/oozie/oozie-client_2.3.0+31.2-1~maverick-cdh3_all.deb Size: 33807008 SHA256: 77c831feaa86eb6ad0d2f4f799324c82c46ddf260699dd92054170e9cc7e2f5b SHA1: 2cdfa20fe8e1223f2b5034d85b59797907e33ac0 MD5sum: 6ff4e164a0920a2a7742e4b5b67a862b Description: Command line utility that allows remote access and operation of oozie. Using this utility, the user can deploy workflows and perform other administrative and monitoring tasks such as start, stop, kill, resume workflows and coordinator jobs. Package: python-hive Source: hadoop-hive Version: 0.7.0+27.1-2~maverick-cdh3 Architecture: all Maintainer: Todd Lipcon Installed-Size: 964 Depends: python, python-support (>= 0.90.0) Provides: python2.6-hive Homepage: http://hadoop.apache.org/hive/ Priority: extra Section: python Filename: pool/contrib/h/hadoop-hive/python-hive_0.7.0+27.1-2~maverick-cdh3_all.deb Size: 68534 SHA256: 08af49c4865c30a1a2465cc699e307a737816f71d09d23a58e96a0afd53fed92 SHA1: 3f253ee116136981eea1552d43740a48307d5ec2 MD5sum: 7128c5cd18cc8d0e34b5822e5a58309d Description: Python client library to talk to the Hive Metastore This is a generated Thrift client to talk to the Hive Metastore. Package: sqoop Version: 1.2.0+24.2-1~maverick-cdh3 Architecture: all Maintainer: Alex Newman Installed-Size: 2332 Depends: sun-java6-jdk, hadoop, hadoop-hbase, hadoop-zookeeper Homepage: http://www.cloudera.com Priority: extra Section: misc Filename: pool/contrib/s/sqoop/sqoop_1.2.0+24.2-1~maverick-cdh3_all.deb Size: 1135346 SHA256: 7c2649cb207e449231cf376b22e1102072e6b97b36b53e753ef4813c623b7636 SHA1: f11ab3cb4eafaff16fa081b019d4f24fc7e50539 MD5sum: 633f614776b57f762a5dee21c9db30b9 Description: Tool for easy imports and exports of data sets between databases and HDFS Sqoop is a tool that provides the ability to import and export data sets between the Hadoop Distributed File System (HDFS) and relational databases. Package: sqoop-metastore Source: sqoop Version: 1.2.0+24.2-1~maverick-cdh3 Architecture: all Maintainer: Alex Newman Installed-Size: 68 Depends: sqoop (= 1.2.0+24.2-1~maverick-cdh3), adduser Homepage: http://www.cloudera.com Priority: extra Section: misc Filename: pool/contrib/s/sqoop/sqoop-metastore_1.2.0+24.2-1~maverick-cdh3_all.deb Size: 6582 SHA256: 0829089e1bea1e70b779c6726a605bc915206dee93fb8c0367651c247dc43eea SHA1: 31e203c93d35eb91b81f155d9d25f96ee0abbcb6 MD5sum: 35440edfca0809fee11c960935006cd7 Description: Shared metadata repository for Sqoop. This optional package hosts a metadata server for Sqoop clients across a network to use. Package: whirr Version: 0.3.0+5.1-1~maverick-cdh3 Architecture: all Maintainer: Tom White Installed-Size: 46008 Depends: sun-java6-jdk Homepage: http://incubator.apache.org/whirr Priority: extra Section: misc Filename: pool/contrib/w/whirr/whirr_0.3.0+5.1-1~maverick-cdh3_all.deb Size: 38199066 SHA256: 4835307ab0cdc29d74dfa52d6ad1830093a037c9dc386b742c2fab5d9ab26646 SHA1: fa5f49785e21053c4245bf54466d48e13177b600 MD5sum: 686c8abb11ba0f7537f64fc6d3ec5ece Description: Scripts and libraries for running software services on cloud infrastructure Whirr provides . * A cloud-neutral way to run services. You don't have to worry about the idiosyncrasies of each provider. * A common service API. The details of provisioning are particular to the service. * Smart defaults for services. You can get a properly configured system running quickly, while still being able to override settings as needed.