commit 8efd9403bec17664774c7ae8fe48ce2357cb64bc Author: Jenkins slave Date: Mon Sep 15 08:21:09 2014 -0700 Preparing for CDH5.1.3 release commit c0c81fb6f0a49b3ce01b50b6161e0e1cacce2b32 Author: Jenkins slave Date: Tue Aug 26 16:49:18 2014 -0700 Preparing for CDH5.1.3 development commit 6e5ad892eac45890cf265c853cde3945951b8168 Author: Jenkins slave Date: Mon Aug 25 10:05:26 2014 -0700 Prepare for CDH5.1.2 release commit aeaf5d741a386c56a1c293974a2732e9506f70ff Author: Jenkins slave Date: Wed Jul 16 14:30:14 2014 -0700 Preparing for CDH5.1.1 development commit 1c83b09d511855dbebea06c63ff9eca74152cc84 Author: Jenkins slave Date: Fri Jul 11 10:51:27 2014 -0700 Preparing for CDH5.1.0 release commit d7ac47e17d532d79abd6fc27a264bf25dafbb2b0 Author: Wolfgang Hoschek Date: Thu Jun 26 10:51:21 2014 -0700 better prevention of potentially gigantic error messages in readCSV commit 48f87e63ed861d27b7a2898009982dbb01ff7386 Author: Wolfgang Hoschek Date: Thu Jun 26 09:53:20 2014 -0700 prevent potentially gigantic error messages in readCSV commit 85c840e842bb3f5af04e17cb11e66bbd96e45ebe Author: Wolfgang Hoschek Date: Wed Jun 25 20:10:02 2014 -0700 Add option to ignore (or fail on) very long records with readCSV command commit f7ab2d8520e0d79aaff661f312ffdc6db0c1bfba Author: Wolfgang Hoschek Date: Fri Jun 20 12:23:40 2014 -0700 enhance performance of removeFields command commit 522b52931c17215a37a75cd3714c2c504d3792ef Author: Wolfgang Hoschek Date: Tue Jun 10 20:34:16 2014 -0700 simplify constructor code commit 71428886151c669e9ed20dcd968699a539f18643 Author: Wolfgang Hoschek Date: Tue Jun 10 20:29:59 2014 -0700 remove unnecessary interface commit 741b6735be729dcf2ce1d08ffea76ffee7bbda39 Author: Wolfgang Hoschek Date: Tue Jun 10 16:32:47 2014 -0700 more performance enhancements for removeFields command commit 0ed1dba1103a2b2ef9c1299f23b955ffa4bd4e0c Author: Wolfgang Hoschek Date: Sun Jun 8 19:39:41 2014 -0700 Also make it possible to compile with guava-16 and guava-17 (guava-16 removed the Closeables class and guava-17 reintroduced it, but with an API that has backwards compat issues) commit 3596bf058cbcdd02eeaf51f2dfe5614cff037b2e Author: Wolfgang Hoschek Date: Sun Jun 8 17:40:58 2014 -0700 Upgrade kite-morphlines-core module from metrics-0.3.1 to bug fix release metrics-0.3.2 commit 9242b6f1979f021f65090356313140ad08612e33 Author: Wolfgang Hoschek Date: Sun Jun 8 17:39:13 2014 -0700 Upgrade kite-morphlines-maxmind module from maxmind-db-0.3.1 to bug fix release maxmind-db-0.3.3 commit 50d6dc21c9eea6a532fb8018e739419c195e898c Author: Wolfgang Hoschek Date: Thu Jun 5 15:45:09 2014 -0700 performance enhancement for removeFields command commit 1851d441bc0c2653b1a7ec3d70919ab33a971355 Author: Wolfgang Hoschek Date: Wed Jun 4 12:11:30 2014 -0700 reduce visibility of internal classes commit c69216ad5c634aed171bbc105e016f7bef74b72e Author: Wolfgang Hoschek Date: Tue Jun 3 18:40:53 2014 -0700 add more tests for removeFields command commit 5de73ebf84357ddc53ff458a2ba89c7b7e4edbec Author: Wolfgang Hoschek Date: Tue Jun 3 18:05:07 2014 -0700 add test for multi-valued field commit 25155e87d7e96232aa25df1dfe838c4131b11b4a Author: Wolfgang Hoschek Date: Tue Jun 3 17:59:41 2014 -0700 performance enhancement for removeFields command commit 27831fce3e9c998182afde2587bc8a397608d403 Author: Wolfgang Hoschek Date: Tue Jun 3 16:50:08 2014 -0700 enhance doc for removeFields command commit d243ba5d8cb76f604e4cafe4e63b182f61f45e21 Author: Wolfgang Hoschek Date: Tue Jun 3 16:47:28 2014 -0700 use better parameter names for the new removeFields command commit a67de9c24597387ac4cbeac2c57794f63be3b9b8 Author: Wolfgang Hoschek Date: Tue Jun 3 16:36:15 2014 -0700 enhance doc for removeFields command commit e8356e41e65402e6b76f01276c1fb8c26e1cdb1c Author: Wolfgang Hoschek Date: Tue Jun 3 17:06:51 2014 -0700 CDK-454: Add morphline command that removes all record fields for which the field name matches a blacklist but not a whitelist commit c074d04f5d945e493fc646b00c6e9c8db8769ecf Author: Tom White Date: Mon Jun 2 14:41:24 2014 +0100 CDK-323: Discard empty files. This fixes CDK-323 by tracking how many events are written and discarding 0-event files. commit 9d45c071dab2f58f141eb7ba922efd9dc92525fc Author: Gregory Chanan Date: Thu May 29 18:40:37 2014 -0700 Give SolrLocator the ability to handle managed schemas. commit 4f7201e35b6203b7adf55882b71f59e2cd47160c Author: Tom White Date: Thu May 1 16:36:44 2014 +0100 CDK-410. SchemaValidationUtil#hasErrors recursion bug. commit b59665d54bf26786ebcb70fff8e9dd5716f66539 Author: Ryan Blue Date: Wed May 7 13:36:24 2014 -0700 CDK-415: Add reconnect and retry support for Hive. This adds a doWithRetry() method to the internal HCatalog class that will run an action and catch TExceptions. If a TException is thrown, it will reconnect to Hive and then retry the action. Because all actions go through this method, the synchronization on the Hive client is there as well. Most of the HCatalog methods caught Exception and RuntimeException. This commit updates the error handling to catch the correct Hive exceptions and handle them more correctly in some cases. The fallback is the same behavior as before when a TException is unknown. Conflicts: kite-data/kite-data-hcatalog/src/main/java/org/kitesdk/data/hcatalog/HCatalog.java commit d2b487c190f7c497dde6fd6ea28cf6414651ab39 Author: Ryan Blue Date: Tue Mar 18 11:16:14 2014 -0700 CDK-292: Fix avro schema comparison bug. This adds a check that the number of sub-fields in a record matches. Otherwise, when fields are added at the end of the record, equals may return incorrectly depending on which schema is the left operand. commit 5aea3dd42e7df5fca57a2798ad6e1cb6cc03c47a Author: Karel Vervaeke Date: Fri Mar 7 10:47:39 2014 +0100 Don't wrap RuntimeException in RuntimeException commit 8e235e93fbd4ea95215d3f3fe7379f98bfa66579 Author: Karel Vervaeke Date: Thu Mar 6 15:59:51 2014 +0100 Avoid concurrent access on thrift client. commit 871cbd16c5d167b9595adcdf6f1d1900ebe1783d Author: Wolfgang Hoschek Date: Tue Apr 15 00:24:40 2014 +0300 CDK-397: Add openHdfsFile morphline command that opens an HDFS file for read and returns a corresponding Java InputStream commit 7631b8ff576c181ccd3fbf1aee32c0bb5ec1339d Author: Wolfgang Hoschek Date: Tue Apr 15 00:21:46 2014 +0300 CDK-396: Add readBlob morphline command that converts an InputStream to a byte array in main memory commit 621a3a75ee0c5738ab75821d528dcfcf417b97d0 Author: Wolfgang Hoschek Date: Wed Mar 26 10:36:56 2014 -0700 CDK-383: Upgrade kite-morphlines-saxon module from Saxon-HE-9.5.1-4 to Saxon-HE-9.5.1-5 commit 7e255487875a8c98a978a80bfbd83459b16b00ef Author: Wolfgang Hoschek Date: Sun Mar 23 22:06:02 2014 -0700 performance enhancement commit 763892083817c9db47444dc4d4114883154bf23d Author: Wolfgang Hoschek Date: Sat Mar 22 18:09:26 2014 -0700 Enable headless mode during unit tests. There is no need to start AWT. This works around a thread leak during unit tests on OSX. commit 62019d17ded4465a34eed2c1173729f3b44e55e2 Author: Wolfgang Hoschek Date: Sat Mar 22 16:40:53 2014 -0700 make it compile with java8 commit 07d4b21090d9fafbc36a8710f19aca882f8d91bb Author: E. Sammer Date: Sun Jan 26 13:51:17 2014 -0800 CDK-287: Grok command should support loading dictionaries from classpath resources. Additionally added some minor debug-level logging to indicate which dictionaries are being loaded, and from where. Grok command docs have also been updated. commit 0530560a65d272c5c028df3e08d7ef08c0426bfb Author: Wolfgang Hoschek Date: Wed Mar 12 15:57:34 2014 -0700 minor performance enhancements commit fe79c8f6b83023a0f0c5fadc61b6777ab137f2ff Author: Wolfgang Hoschek Date: Tue Mar 11 14:01:30 2014 -0700 cleanup visibility commit 159b151ddbc0305886a77cd77805b173ccf8acc8 Author: Wolfgang Hoschek Date: Mon Mar 10 15:15:20 2014 -0700 performance enhancement commit 10eebc731b4998c47d50464b3e678c61da032982 Author: Wolfgang Hoschek Date: Thu Mar 6 18:54:34 2014 -0800 renamed new "destination" param to "outputField" param to be consistent with other existing commands commit cedddf57ecfc7aab8c482556186a01b51e76a2dc Author: Bruno Mahé Date: Thu Mar 6 17:46:43 2014 -0800 Add ability to readClob to store lines to a specific field commit e37b887fff1a572ef1d49707049c7a9195f07beb Author: Wolfgang Hoschek Date: Tue Mar 4 11:25:38 2014 -0800 improve code readability commit fbf0f49766c1d7543808432a462c09189b69e44f Author: Wolfgang Hoschek Date: Mon Mar 3 14:44:53 2014 -0800 better log message commit 78d59f9155e0e15f9ec41537fb8d4d6e20db293a Author: Jenkins slave Date: Mon Mar 3 11:02:38 2014 -0800 Preparing for CDH5.1.0 development commit b36debb820f9e344d8f4d51650c555be7da554a3 Author: Ryan Blue Date: Wed Feb 12 12:50:59 2014 -0800 CDK-317: Fix HCatalog URIs. This updates URIPattern to catch the case where a URI pattern does not have a path but the tested URI does, and fail the URI. This allows the HCatalog URIs to separately handle URIs without a path (Managed) and URIs with a path (External) while still allowing the user to override the authority information, host/port. Previously, the HCatalog URIs used the root path, /, to signal that the URI was for a managed HCatalog repository rather than an external one. This is no longer necessary because the update to URIPattern and any non-null path now returns an external repo. Conflicts: kite-data/kite-data-hcatalog/src/main/java/org/kitesdk/data/hcatalog/HCatalogDatasetRepository.java kite-data/kite-data-hcatalog/src/main/java/org/kitesdk/data/hcatalog/HCatalogExternalDatasetRepository.java kite-data/kite-data-hcatalog/src/test/java/org/kitesdk/data/hcatalog/TestHiveURIs.java commit 37ae4035e48919799d5364632f8b09eacd5f748e Author: Micah Whitacre Date: Wed Jan 29 13:38:19 2014 -0600 CDK-294: Default the action specific NN and JT to Oozie Properties commit 36c2103b8b0194fd659ff1a2242a2ce47f2cf03b Author: Wolfgang Hoschek Date: Mon Feb 10 19:17:04 2014 +0200 CDK-318: Add a sampling morphline command that forwards each input record with a given probability to its child, and silently ignores all other input records CDK-319: Add morphline command that ignores all input records beyond the N-th record, thus emitting at most N records, akin to the Unix head command commit 0fdb90ea4154b30ba8638648ac88f3542cae3b86 Author: Wolfgang Hoschek Date: Sat Feb 8 22:52:13 2014 +0200 CDK-315: Improve morphline import performance if all commands are specified via fully qualified class names commit 108cdd14dbcb9ed68a7dbb77123f08cb34f17181 Author: Wolfgang Hoschek Date: Fri Feb 7 09:32:20 2014 +0200 expand unit test commit 75da50e5fb49535a9ee963588a4c8dff1f0285d8 Author: Wolfgang Hoschek Date: Thu Feb 6 19:14:33 2014 +0200 CDK-310: addValuesIfAbsent morphline command should avoids performance degradation for large N commit de456601ef6eaaa32a6fb3384bf09adff8f2f978 Author: Wolfgang Hoschek Date: Wed Feb 5 17:27:03 2014 +0200 optimize field expressions by separating compilation and evaluation phase commit 9ef10c8ea118d722ef54883536005cdf7fb22357 Author: Wolfgang Hoschek Date: Wed Feb 5 17:26:45 2014 +0200 optimize field expressions by separating compilation and evaluation phase commit b9e0bb4d228e67acc129828641bc59b8d73f0609 Author: Wolfgang Hoschek Date: Wed Feb 5 13:53:38 2014 +0200 regex performance enhancement commit 50813f296917079b9c43dd9d0f4fd57d118eaa01 Author: Wolfgang Hoschek Date: Tue Feb 4 23:05:37 2014 +0200 minor readability improvement commit a874cebbe469dd3db7cf86adaac72860a552abbd Author: Wolfgang Hoschek Date: Tue Feb 4 23:08:22 2014 +0200 CDK-305: splitKeyValue morphline command should also support multi-character separators as well as a regex separators commit 7cfeda81706e1239ea6bf5c32fe721145d47a389 Author: Wolfgang Hoschek Date: Fri Jan 31 23:45:49 2014 +0200 CDK-300: SolrLocator should delete the tmp dir that it creates as soon as possible, not just on VM exit commit 88d35366f44403a757bd2eb44b6e06babe028aa9 Author: Wolfgang Hoschek Date: Fri Jan 31 23:24:22 2014 +0200 CDK-300: SolrLocator should delete the tmp dir that it creates as soon as possible, not just on VM exit commit d006bc13f60a08e0b07b63024c0a537fa3bbc191 Author: Wolfgang Hoschek Date: Tue Jan 28 03:16:54 2014 +0200 CDK-290: solrCell morphline command fails with Tika RFC822Parser if log4j TRACE mode is enabled and multiple MIME parts are contained in the message commit 2e24f9939089d8cebe5489a1e1f023c36ac9b265 Author: Wolfgang Hoschek Date: Thu Jan 23 17:02:47 2014 +0200 reducing visibility of internal shaded classes commit 710dd3a8955205d47538ef1d001e75017fb0d7f3 Author: Wolfgang Hoschek Date: Thu Jan 23 16:59:13 2014 +0200 reducing visibility of internal shaded classes commit 3e2aca5cece6ece58b81daaa6564383b833c8fb3 Author: Wolfgang Hoschek Date: Thu Jan 23 16:21:07 2014 +0200 remove dead code commit 3bda431c89e8473a712bc7e6af6ee53856aa1513 Author: Wolfgang Hoschek Date: Thu Jan 23 15:08:48 2014 +0200 fix doc typo commit 1b84f7d6ac441866f4d297e4a27e35e87f832e2e Author: Wolfgang Hoschek Date: Thu Jan 23 21:07:10 2014 +0200 CDK-286: Upgrade kite-morphlines-saxon module from Saxon-HE-9.5.1-3 to Saxon-HE-9.5.1-4 commit 44051bd3b176973d21c4b2a7632290c3ea391bea Author: Ryan Blue Date: Mon Jan 20 15:37:57 2014 -0800 CDK-284: Fix Hive partition creation. This commit updates the hive module to create partitions using HiveMetaStoreClient#appendPartition rather than manually creating a Partition object. The update fixes partition creation because the location and partition values are correctly set. In addiiton, this now catches and ignores AlreadyExistsException. If the partition exists, it is okay. commit e120e18b3d1095a8ed493d301febd07df5a2533f Author: Wolfgang Hoschek Date: Tue Jan 21 01:57:26 2014 +0200 remove obsolete imports commit cfd77c993300ce06708e88c7e855d74bb55f08dc Author: Wolfgang Hoschek Date: Tue Jan 21 01:52:42 2014 +0200 CDK-285: Remove auxiliary dependency of kite-morphlines-hadoop-core on commons-io commit 4013cf61afd3274efbf27512a4674419749b00d6 Author: Wolfgang Hoschek Date: Tue Jan 21 01:05:54 2014 +0200 CDH-16886: Add option to specify reader schema for readAvroParquetFile morphline command commit 3a727c299344ba1a0865ef2f339a408d061893cf Author: Wolfgang Hoschek Date: Fri Jan 17 16:49:03 2014 +0200 CDK-282: readRCFile command is broken commit 26a34bd0032badcfe7c66c5e348fc612424a15be Author: Wolfgang Hoschek Date: Fri Jan 17 01:32:58 2014 +0200 rename readerSchema to projectionSchema in (not yet released) readAvroParquetFile command. This harmonizes the name with upstream parquet-avro module and avoids unnecessary confusion. commit be70899054de257c0e7eca8f4643ad1dab1b4470 Author: Ryan Blue Date: Wed Jan 15 14:53:58 2014 -0800 CDK-280: Remove kite-flume-avro-event-serializer. This has been moved upstream and is generally available. commit a406b1f96fe9af7276feffede14da71d0b363fb6 Author: Ryan Blue Date: Tue Jan 7 11:37:50 2014 -0800 CDK-249: Update for recommendations on PR#8. This alters the partitionAdded signature to use a String rather than a StorageKey. This removes the need to alter StorageKey and to expose the dirnameForValue method through the Accessor. In addition, the partition logic is kept in FileSystemDataset and PartitionedDatasetWriter rather than in FileSystemWriters. When calling partitionAdded, the implementations now pass the String path component that was actually used to create the writer. commit 6a89d7a46af47d04ed0e104af2c05e529cc6bf52 Author: Tom White Date: Mon Jan 6 14:53:33 2014 +0000 CDK-249. Hive partitions are not created in the metastore. commit c7b4617873e81382e1060a23816d1282cdc5c84e Author: Ryan Blue Date: Mon Dec 30 11:59:27 2013 -0800 CDK-266: Use random UUIDs for unique file names. commit 79de856e24285d80326b4d3a12d26ef1c1e06a34 Author: Ryan Blue Date: Wed Jan 15 09:24:35 2014 -0800 CLOUDERA-BUILD. CDH-14681: disable flume modules. This avoids a circular dependency between flume and Kite. commit 6132d875fd64bb79dfb48ed4a80954cf0f6e37a0 Author: Wolfgang Hoschek Date: Wed Jan 8 21:23:29 2014 +0200 Upgrade kite-morphlines-metrics-servlets module from jetty-8.1.13.v20130916 to jetty-8.1.14.v20131031 commit f725d6e5724b8ee1fa1680dd8a9bf40e0d7926de Author: Wolfgang Hoschek Date: Wed Jan 8 21:02:39 2014 +0200 CDH-16616: MapReduceIndexerTool and SolrSink and HBaseMapReduceIndexerTool should fail fast if zk does not contain solrconfig.xml commit 8fda847be8fe72225bf17ce47c92c638bb98a70d Author: Wolfgang Hoschek Date: Wed Jan 8 14:13:58 2014 +0200 Upgrade kite-morphlines-metrics-servlets module from jetty-8.1.13.v20130916 to jetty-8.1.14.v20131031 commit a8f460fe6a3d3657f1c7e7dfeec26566e82b9ee3 Author: Wolfgang Hoschek Date: Tue Jan 7 13:46:52 2014 +0200 add log message for better diagnostics commit 3f6e59a53650e422679d3919bd2b293aaa5588d0 Author: Tom White Date: Wed Jan 8 14:44:02 2014 +0000 CDK-275. Upgrade to Crunch 0.9.0. commit 4154a28c89f4ff098704912dbad1cdadf1e5d3eb Author: Ryan Blue Date: Mon Jan 6 12:33:38 2014 -0800 CLOUDERA-BUILD. Restore vers.crunch-base to fix warnings. commit 807a977d0dbc237c3be3c8dfd9deb78cd64f83b6 Author: Ryan Blue Date: Mon Jan 6 11:50:53 2014 -0800 CLOUDERA-BUILD. Update crunch dependency for CDH5. commit 5b6743019e75fc212894e7382ed8e13ac0a3624f Author: jh016266 Date: Thu Jan 2 19:00:51 2014 -0600 Fix typo in Notifications class commit b5e49e541b289fbf246ecd089dc89bb37ba1a11f Author: Wolfgang Hoschek Date: Fri Jan 3 14:34:08 2014 +0200 force hadoop-client to be "provided" for parquet commit b7083aec356ea703fd7f21ddd37122bbe56fc347 Author: Wolfgang Hoschek Date: Fri Jan 3 00:50:34 2014 +0200 force hadoop-client to be "provided" for parquet commit ac46da0c21a346af3a254f7342b0de43565452d3 Author: Wolfgang Hoschek Date: Thu Jan 2 21:53:39 2014 +0200 CDK-271: Add "addEmptyStrings" parameter to readCSV morphline command to indicate whether or not to add zero length strings to the output field. commit e281642073a89664473b3e58e71d36809f657ffd Author: Wolfgang Hoschek Date: Mon Dec 30 23:17:39 2013 +0200 allow passing explicit Path objects to readAvroParquetFile command commit 66156fdfa310284401fba5c2167479830ccf3e1a Author: Wolfgang Hoschek Date: Mon Dec 30 23:16:37 2013 +0200 CDK-267: Add morphline command that parses an Avro Parquet file commit c050e65a3e2f894ab386a2458415042e4aa23aef Author: Andrew Bayer Date: Thu Jan 2 11:43:57 2014 -0800 CLOUDERA-BUILD. Fixing kite-morphlines-hadoop-parquet-avro parent POM version commit f932ce6c94eb39790781cd6715a0b55a5f2e30af Author: Wolfgang Hoschek Date: Thu Jan 2 21:33:28 2014 +0200 CDK-267: Add morphline command that parses an Avro Parquet file commit c6ba527106c34b9a149963159f4d71d065af22f7 Author: Wolfgang Hoschek Date: Tue Dec 31 01:13:34 2013 +0200 CDK-263: Upgrade kite-morphlines-json module from jackson-databind-2.2.1 to jackson-databind-2.3.1 commit f4494f7846df46ce4c7479ace7b36c1f9bb9959c Author: Wolfgang Hoschek Date: Tue Dec 31 01:13:19 2013 +0200 CDK-263: Upgrade kite-morphlines-json module from jackson-databind-2.2.1 to jackson-databind-2.3.1 commit 4df45e1a3de4e6a4babd6cf0cbc541da0b259552 Author: Wolfgang Hoschek Date: Tue Dec 31 01:09:53 2013 +0200 Revert "CDK-263: Upgrade tika-morphlines-json module from jackson-databind-2.2.1 to jackson-databind-2.3.1" This reverts commit 9f25dffeecafd547db3fe4817fffe822bc6c057b. commit 9f25dffeecafd547db3fe4817fffe822bc6c057b Author: Wolfgang Hoschek Date: Tue Dec 31 01:07:31 2013 +0200 CDK-263: Upgrade tika-morphlines-json module from jackson-databind-2.2.1 to jackson-databind-2.3.1 commit 17ba192759324847855d0cc5c40ea4d10268f275 Author: Wolfgang Hoschek Date: Fri Dec 27 19:43:52 2013 +0200 fix typo commit da0882d066902246d63dedefcce65e559096673b Author: Ryan Blue Date: Fri Dec 20 18:31:02 2013 -0800 DK-260: Revert change to TestMetadataProviders. commit 7ecef3ad68abcd0c443e93d900d22c57d72c0aac Author: Ryan Blue Date: Fri Dec 20 16:10:31 2013 -0800 CDK-260: Fix PartitionExpression for DateFormatPartitioner. This moves the toExpression(FieldPartitioner) method into the PartitionFunctions class. This allows accessor methods to be package-private and puts the expression logic for both directions in one place to avoid oversight. commit 82281bc72a068bde969afca77b7c7c7f84a314f7 Author: Wolfgang Hoschek Date: Thu Dec 26 19:11:45 2013 +0200 CDK-263: Upgrade cdk-morphlines-json module from jackson-databind-2.2.1 to jackson-databind-2.3.0 commit 24b8bd217a225b58ebe07a038ddb1a7848ffcda4 Author: Wolfgang Hoschek Date: Thu Dec 26 19:11:00 2013 +0200 CDK-263: Upgrade cdk-morphlines-json module from jackson-databind-2.2.1 to jackson-databind-2.3.0 commit 1d7cd0bfb3d0d4277662692b288aa39a625960d3 Author: Wolfgang Hoschek Date: Wed Dec 25 22:25:23 2013 +0200 CDK-262: Tika parsers used by solrCell morphline command should not depend on the default Java locale commit c10768329bd74210ca35b49e80eeec1baab3e913 Author: Wolfgang Hoschek Date: Wed Dec 25 21:45:52 2013 +0200 organize imports commit d5d62fe331a9959b1a5ad9e63283c1afd4d7db88 Author: Wolfgang Hoschek Date: Wed Dec 25 21:34:44 2013 +0200 disable date related test because *.xlsx solr cell parser is locale sensitive; other solr cell parsers appear not to be locale sensitive, after all. commit 2a99379b909e224893d6ff1386c9ec0d377ae27e Author: Wolfgang Hoschek Date: Tue Dec 24 23:57:01 2013 +0200 add cache hit/miss metrics to userAgent morphline command commit a21ed4128f96887abf4051b9abc25d1560b2568d Author: Wolfgang Hoschek Date: Tue Dec 24 23:19:26 2013 +0200 - use windows path conventions on windows - don't assume Java default locale for date conversions commit cee13d4fab977b5e7b978a1b5323e5f35e2df4cb Author: Wolfgang Hoschek Date: Tue Dec 17 14:19:30 2013 -0800 use better variable name commit 8261eb38f7f9804915152a0cbeef3d52e641a0fe Author: Wolfgang Hoschek Date: Tue Dec 17 14:03:52 2013 -0800 Add more logging for better diagnostics & debugging commit 43032c0e287461386284e3f83bf8460922d8c21f Author: Wolfgang Hoschek Date: Sun Dec 15 11:41:30 2013 -0800 fix broken doc links commit f49759a27e76ad9f8b409016c7e119fc6c8689a1 Author: Wolfgang Hoschek Date: Sun Dec 15 11:31:14 2013 -0800 make sanity null checks more explicit commit 2d68751fb8bcd066eeef8641d4696ea3aacdfb4c Author: Wolfgang Hoschek Date: Fri Dec 13 09:43:48 2013 -0800 CDK-256: update the version numbers in the "Usage in your Maven Project" snippets from 0.8.1 to 0.10.0 in the kite-morphlines docs commit 280a882caf7b121c400d17a900cdc48027de5753 Author: Wolfgang Hoschek Date: Tue Dec 10 14:05:11 2013 -0800 fix some doc issues commit 0e5e1e4446142be7d31562eadee54f08d0f08f02 Author: Ryan Blue Date: Thu Dec 19 11:33:31 2013 -0800 CLOUDERA-BUILD. Enable kite-data-hbase. commit 3e9bc949e96327380b3f2fc2e373a2343bdfc00a Author: Ryan Blue Date: Thu Dec 19 11:21:34 2013 -0800 CLOUDERA-BUILD. Fix kite-data-hbase tests. Cherry-picked from da61b08. HBase 0.96.0 puts now require a >0 length key byte array. We used a special 0 length byte array for the version record who's put values were merged into an entity put for an actual row, so had to change this to a byte array of length 1. commit 2625e03471237418631baa8cdb65e59be5597611 Author: Adam Warrington Date: Tue Dec 17 21:10:01 2013 -0500 CLOUDERA-BUILD. Fix kite-data-hbase test deps. Cherry-picked from da61b08. This commit fixes the kite-data-hbase tests. These were failing due to the HBase mini cluster code depending on some hbase-hadoop-compat classes. Added the hbase-hadoop-compat and hbase-hadoop2-compat packages as test dependencies to fix. commit 60e78c4bb231df326cd119ed00ca35a067bde913 Author: Tom White Date: Mon Dec 16 16:11:07 2013 +0000 CLOUDERA-BUILD. Use common CDH version of Parquet. commit 9e0955b530fedc0db433dfa87f44fa1d34fd771d Author: Ryan Blue Date: Wed Dec 11 17:57:03 2013 -0800 Fix solr dependency commit e75b275f4d0f2acd25f4547dd7be1c29bcb61a0d Author: Ryan Blue Date: Wed Dec 11 17:46:12 2013 -0800 Update log4j version to match CDH commit ddebb29d3924af8b19176fab374cb009c2dc1ec6 Author: Ryan Blue Date: Wed Dec 11 17:44:51 2013 -0800 Set slf4j version to cdh.slf4j.version. commit f3f413508161d46df3f89e510a0f2dfa990263e2 Author: Ryan Blue Date: Wed Dec 11 17:42:44 2013 -0800 Fix expected solr version. commit f13540043e53a71d74ff34d9de0c3675ad8bde23 Author: Ryan Blue Date: Wed Dec 11 17:34:43 2013 -0800 Disable kite-data-maven in the build. I've updated the code so it compiles, but it is failing tests with a zookeeper issue related to upstream test util refactoring. We need to get morphlines out for search, so this is disabled for now. commit dd8ca3e27aefc273ca465e6fed1de6596b1d2566 Author: Ryan Blue Date: Wed Dec 11 16:47:35 2013 -0800 Fix Hive TestSchemaConversion for avro 1.7.4. commit 9f3f11c12723b4196ed4e028f23e5bca02b7d530 Author: Ryan Blue Date: Wed Dec 11 16:33:58 2013 -0800 Fix hcatalog dependencies for 0.12.0. HCatalog is now versioned as part of hive. commit b67d27be05573e8949876e8aa8d372a52f18e003 Author: Ryan Blue Date: Wed Dec 11 16:16:25 2013 -0800 Remove references to missing SchemaBuilder. Avro is currently 1.7.4 isntead of 1.7.5. This should be a temporary change. commit 8ef8439453824234773d29d5ec7f9f0bf0baa00f Author: Ryan Blue Date: Wed Dec 11 15:52:42 2013 -0800 Update POM for C5 build.