CDH 5.13.0 Release Notes
The following lists all Apache Hive Jiras included in CDH 5.13.0
that are not included in the Apache Hive base version 1.1.0. The
hive-1.1.0-cdh5.13.0.CHANGES.txt
file lists all changes included in CDH 5.13.0. The patch for each
change can be found in the cloudera/patches directory in the release tarball.
Changes Not In Apache Hive 1.1.0
Common
Bug
- [HADOOP-13867] - FilterFileSystem should override rename(.., options) to take effect of Rename options called via FilterFileSystem implementations
Parquet
Bug
- [PARQUET-26] - Parquet doesn't recognize the nested Array type in MAP as ArrayWritable.
HDFS
Bug
- [HDFS-8312] - Trash does not descent into child directories to check for permissions
Hive
Bug
- [HIVE-17368] - DBTokenStore fails to connect in Kerberos enabled remote HMS environment
- [HIVE-17225] - HoS DPP pruning sink ops can target parallel work objects
- [HIVE-17405] - HoS DPP ConstantPropagate should use ConstantPropagateOption.SHORTCUT
- [HIVE-17216] - Additional qtests for HoS DPP
- [HIVE-11193] - ConstantPropagateProcCtx should use a Set instead of a List to hold operators to be deleted
- [HIVE-16949] - Leak of threads from Get-Input-Paths and Get-Input-Summary thread pool
- [HIVE-13514] - TestClearDanglingScratchDir fail on branch-1
- [HIVE-15233] - UDF UUID() should be non-deterministic
- [HIVE-17268] - WebUI / QueryPlan: query plan is sometimes null when explain output conf is on
- [HIVE-16982] - WebUI "Show Query" tab prints "UNKNOWN" instead of explaining configuration option
- [HIVE-17008] - Fix boolean flag switchup in DropTableEvent
- [HIVE-15208] - Query string should be HTML encoded for Web UI
- [HIVE-6727] - Table level stats for external tables are set incorrectly
- [HIVE-16978] - HoS: add current thread ID to the log redirector for the RemoteDriver
- [HIVE-16675] - Fix ConcurrentModificationException in SparkClientImpl#startDriver
- [HIVE-16845] - INSERT OVERWRITE a table with dynamic partitions on S3 fails with NPE
- [HIVE-17247] - HoS DPP: UDFs on the partition column side does not evaluate correctly
- [HIVE-12473] - DPP: UDFs on the partition column side does not evaluate correctly
- [HIVE-16315] - Describe table doesn't show num of partitions
- [HIVE-12772] - Beeline/JDBC output of decimal values is not 0-padded, does not match with CLI output
- [HIVE-11462] - GenericUDFStruct should constant fold at compile time
- [HIVE-16998] - Add config to enable HoS DPP only for map-joins
- [HIVE-17050] - Multiline queries that have comment in middle fail when executed via "beeline -e"
- [HIVE-17087] - Remove unnecessary HoS DPP trees during map-join conversion
- [HIVE-12762] - Common join on parquet tables returns incorrect result when hive.optimize.index.filter set to true
- [HIVE-12551] - Fix several kryo exceptions in branch-1
- [HIVE-17052] - Remove logging of predicate filters
- [HIVE-16961] - Hive on Spark leaks spark application in case user cancels query and closes session
- [HIVE-10616] - TypeInfoUtils doesn't handle DECIMAL with just precision specified
- [HIVE-17150] - CREATE INDEX execute HMS out-of-transaction listener calls inside a transaction
- [HIVE-17149] - Hdfs directory is not cleared if partition creation failed on HMS
- [HIVE-11592] - ORC metadata section can sometimes exceed protobuf message size limit
- [HIVE-16960] - Hive throws an ugly error exception when HDFS sticky bit is set
- [HIVE-16183] - Fix potential thread safety issues with static variables
- [HIVE-10209] - FetchTask with VC may fail because ExecMapper.done is true
- [HIVE-16930] - HoS should verify the value of Kerberos principal and keytab file before adding them to spark-submit command parameters
- [HIVE-15428] - HoS DPP doesn't remove cyclic dependency
- [HIVE-13588] - NPE is thrown from MapredLocalTask.executeInChildVM
- [HIVE-16935] - Hive should strip comments from input before choosing which CommandProcessor to run.
- [HIVE-13330] - ORC vectorized string dictionary reader does not differentiate null vs empty string dictionary
- [HIVE-13947] - HoS print wrong number for hash table size in map join scenario
- [HIVE-16875] - Query against view with partitioned child on HoS fails with privilege exception.
- [HIVE-16559] - Parquet schema evolution for partitioned tables may break if table and partition serdes differ
- [HIVE-16869] - Hive returns wrong result when predicates on non-existing columns are pushed down to Parquet reader
- [HIVE-16291] - Hive fails when unions a parquet table with itself
- [HIVE-16450] - Some metastore operations are not retried even with desired underlining exceptions
- [HIVE-15122] - Hive: Upcasting types should not obscure stats (min/max/ndv)
- [HIVE-14178] - Hive::needsToCopy should reuse FileUtils::equalsFileSystem
- [HIVE-11878] - ClassNotFoundException can possibly occur if multiple jars are registered one at a time in Hive
- [HIVE-16665] - Race condition in Utilities.GetInputPathsCallable --> createDummyFileForEmptyPartition
- [HIVE-16613] - SaslClientHandler.sendHello is eating exceptions
- [HIVE-9567] - JSON SerDe not escaping special chars when writing char/varchar data
- [HIVE-13390] - HiveServer2: Add more test to ZK service discovery using MiniHS2
- [HIVE-14564] - Column Pruning generates out of order columns in SelectOperator which cause ArrayIndexOutOfBoundsException.
- [HIVE-16693] - beeline "source" command freezes if you have a comment in it?
- [HIVE-16468] - BeeLineDriver should be able to run tests against an externally created cluster
- [HIVE-16593] - SparkClientFactory.stop may prevent JVM from exiting
- [HIVE-16389] - Allow HookContext to access SQLOperationDisplay
- [HIVE-12767] - Implement table property to address Parquet int96 timestamp bug
- [HIVE-16469] - Parquet timestamp table property is not always taken into account
- [HIVE-16660] - Not able to add partition for views in hive when sentry is enabled
- [HIVE-15483] - Database and table name is case sensitive when used in show grant
- [HIVE-16413] - Create table as select does not check ownership of the location
- [HIVE-16584] - Warning messages should use LogHelper.printInfo instead of printing to the infoStream directly
- [HIVE-16308] - PreExecutePrinter and PostExecutePrinter should log to INFO level instead of ERROR
- [HIVE-16363] - QueryLifeTimeHooks should catch parse exceptions
- [HIVE-15900] - Beeline prints tez job progress in stdout instead of stderr
- [HIVE-16646] - Alias in transform ... as clause shouldn't be case sensitive
- [HIVE-16459] - Forward channelInactive to RpcDispatcher
- [HIVE-16451] - Race condition between HiveStatement.getQueryLog and HiveStatement.runAsyncOnServer
- [HIVE-14380] - Queries on tables with remote HDFS paths fail in "encryption" checks.
- [HIVE-15867] - Add blobstore tests for import/export
- [HIVE-16487] - Serious Zookeeper exception is logged when a race condition happens
- [HIVE-12065] - FS stats collection may generate incorrect stats for multi-insert query
- [HIVE-16060] - GenericUDTFJSONTuple's json cache could overgrow beyond its limit
- [HIVE-16004] - OutOfMemory in SparkReduceRecordHandler with vectorization mode
- [HIVE-15297] - Hive should not split semicolon within quoted string literals
- [HIVE-16538] - TestExecDriver fails if run after TestOperators#testScriptOperator
- [HIVE-15761] - ObjectStore.getNextNotification could return an empty NotificationEventResponse causing TProtocolException
- [HIVE-16047] - Shouldn't try to get KeyProvider unless encryption is enabled
- [HIVE-11428] - Performance: Struct IN() clauses are extremely slow (~10x slower)
- [HIVE-11842] - Improve RuleRegExp by caching some internal data structures
- [HIVE-11141] - Improve RuleRegExp when the Expression node stack gets huge
- [HIVE-14774] - Canceling query using Ctrl-C in beeline might lead to stale locks
- [HIVE-16422] - Should kill running Spark Jobs when a query is cancelled.
- [HIVE-16394] - HoS does not support queue name change in middle of session
- [HIVE-9679] - Remove redundant null-checks from DbNotificationListener.
- [HIVE-16057] - SchemaTool ignores --passWord argument if hadoop.security.credential.provider.path is configured
- [HIVE-16196] - UDFJson having thread-safety issues
- [HIVE-9954] - UDFJson uses the == operator to compare Strings
- [HIVE-16156] - FileSinkOperator should delete existing output target when renaming
- [HIVE-14210] - ExecDriver should call jobclient.close() to trigger cleanup
- [HIVE-16024] - MSCK Repair Requires nonstrict hive.mapred.mode
- [HIVE-16192] - QTestUtil doesn't correctly set the last command when running "test" commands
- [HIVE-16175] - Possible race condition in InstanceCache
- [HIVE-15282] - Different modification times are used when an index is built and when its staleness is checked
- [HIVE-12768] - Thread safety: binary sortable serde decimal deserialization
- [HIVE-14819] - FunctionInfo for permanent functions shows TEMPORARY FunctionType
- [HIVE-15952] - Add blobstore integration test for CREATE LIKE
- [HIVE-14964] - Failing Test: Fix TestBeelineArgParsing tests
- [HIVE-15997] - Resource leaks when query is cancelled
- [HIVE-13705] - Insert into table removes existing data
- [HIVE-12179] - Add option to not add spark-assembly.jar to Hive classpath
- [HIVE-16086] - Fix HiveMetaStoreChecker.checkPartitionDirsSingleThreaded method
- [HIVE-16019] - Query fails when group by/order by on same column with uppercase name
- [HIVE-15803] - msck can hang when nested partitions are present
- [HIVE-13864] - Beeline ignores the command that follows a semicolon and comment
- [HIVE-15782] - query on parquet table returns incorrect result when hive.optimize.index.filter is set to true
- [HIVE-15766] - DBNotificationlistener leaks JDOPersistenceManager
- [HIVE-12349] - NPE in ORC SARG for IS NULL queries on Timestamp and Date columns
- [HIVE-11147] - MetaTool doesn't update FS root location for partitions with space in name
- [HIVE-15856] - Hive export/import (hive.exim.uri.scheme.whitelist) to support s3a
- [HIVE-13463] - Fix ImportSemanticAnalyzer to allow for different src/dst filesystems
- [HIVE-15872] - The PERCENTILE_APPROX UDAF does not work with empty set
- [HIVE-13676] - Tests failing because metastore doesn't come up
- [HIVE-14054] - TestHiveMetaStoreChecker fails on master
- [HIVE-15346] - "values temp table" should not be an input source for the query
- [HIVE-7723] - Explain plan for complex query with lots of partitions is slow due to in-efficient collection used to find a matching ReadEntity
- [HIVE-15754] - exchange partition is not generating notifications
- [HIVE-15517] - NOT (x <=> y) returns NULL if x or y is NULL
- [HIVE-15485] - Investigate the DoAs failure in HoS
- [HIVE-15065] - SimpleFetchOptimizer should decide based on metastore stats when available
- [HIVE-15778] - DROP INDEX (non-existent) throws NPE when using DbNotificationListener
- [HIVE-15735] - In some cases, view objects inside a view do not have parents
- [HIVE-15137] - metastore add partitions background thread should use current username
- [HIVE-15679] - Some metastore metrics always contains column=null (write_column_statistics, write_partition_column_statistics)
- [HIVE-15338] - Wrong result from non-vectorized DATEDIFF with scalar parameter of type DATE/TIMESTAMP
- [HIVE-9869] - Trunk doesn't build with hadoop-1
- [HIVE-14693] - Some paritions will be left out when partition number is the multiple of the option hive.msck.repair.batch.size
- [HIVE-14764] - Enabling "hive.metastore.metrics.enabled" throws OOM in HiveMetastore
- [HIVE-13510] - Dynamic partitioning doesnt work when remote metastore is used
- [HIVE-13149] - Remove some unnecessary HMS connections from HS2
- [HIVE-11376] - CombineHiveInputFormat is falling back to HiveInputFormat in case codecs are found for one of the input files
- [HIVE-14820] - RPC server for spark inside HS2 is not getting server address properly
- [HIVE-13240] - GroupByOperator: Drop the hash aggregates when closing operator
- [HIVE-11293] - HiveConnection.setAutoCommit(true) throws exception
- [HIVE-10655] - [PTest2] Propagate additionalProfiles flag to the source-prep.vm
- [HIVE-12261] - schematool version info exit status should depend on compatibility, not equality
- [HIVE-15060] - Remove the autoCommit warning from beeline
- [HIVE-15551] - memory leak in directsql for mysql+bonecp specific initialization
- [HIVE-15530] - Optimize the column stats update logic in table alteration
- [HIVE-13051] - Deadline class has numerous issues
- [HIVE-12789] - Fix output twice in the history command of Beeline
- [HIVE-11594] - Analyze Table For Columns cannot handle columns with embedded spaces
- [HIVE-12465] - Hive might produce wrong results when (outer) joins are merged
- [HIVE-15367] - CTAS with LOCATION should write temp data under location directory rather than database location
- [HIVE-11849] - NPE in HiveHBaseTableShapshotInputFormat in query with just count(*)
- [HIVE-9481] - allow column list specification in INSERT statement
- [HIVE-14960] - Improve the stability of TestNotificationListener
- [HIVE-14873] - Add UDF for extraction of 'day of week'
- [HIVE-13866] - flatten callstack for directSQL errors
- [HIVE-12976] - MetaStoreDirectSql doesn't batch IN lists in all cases
- [HIVE-13895] - HoS start-up overhead in yarn-client mode
- [HIVE-14924] - MSCK REPAIR table with single threaded is throwing null pointer exception
- [HIVE-15385] - Failure to inherit permissions when running HdfsUtils.setFullFileStatus(..., false) causes queries to fail
- [HIVE-15359] - skip.footer.line.count doesnt work properly for certain situations
- [HIVE-15361] - INSERT dynamic partition on S3 fails with a MoveTask failure
- [HIVE-15355] - Concurrency issues during parallel moveFile due to HDFSUtils.setFullFileStatus
- [HIVE-13539] - HiveHFileOutputFormat searching the wrong directory for HFiles
- [HIVE-15291] - Comparison of timestamp fails if only date part is provided.
- [HIVE-13911] - load inpath fails throwing org.apache.hadoop.security.AccessControlException
- [HIVE-13904] - Ignore case when retrieving ColumnInfo from RowResolver
- [HIVE-13966] - DbNotificationListener: can loose DDL operation notifications
- [HIVE-14175] - Fix creating buckets without scheme information
- [HIVE-11920] - ADD JAR failing with URL schemes other than file/ivy/hdfs
- [HIVE-15280] - Hive.mvFile() misses the "." char when joining the filename + extension
- [HIVE-15199] - INSERT INTO data on S3 is replacing the old rows with the new ones
- [HIVE-10576] - add jar command does not work with Windows OS
- [HIVE-15231] - query on view with CTE and alias fails with table not found error
- [HIVE-14301] - insert overwrite fails for nonpartitioned tables in s3
- [HIVE-12619] - (Parquet) Switching the field order within an array of structs causes the query to fail
- [HIVE-13129] - CliService leaks HMS connection
- [HIVE-14205] - Hive doesn't support union type with AVRO file format
- [HIVE-12891] - Hive fails when java.io.tmpdir is set to a relative location
- [HIVE-15054] - Hive insertion query execution fails on Hive on Spark
- [HIVE-15090] - Temporary DB failure can stop ExpiredTokenRemover thread
- [HIVE-12757] - Fix TestCodahaleMetrics#testFileReporting
- [HIVE-13960] - Session timeout may happen before HIVE_SERVER2_IDLE_SESSION_TIMEOUT for back-to-back synchronous operations.
- [HIVE-14173] - NPE was thrown after enabling directsql in the middle of session
- [HIVE-15061] - Metastore types are sometimes case sensitive
- [HIVE-10384] - RetryingMetaStoreClient does not retry wrapped TTransportExceptions
- [HIVE-14421] - FS.deleteOnExit holds references to _tmp_space.db files
- [HIVE-14313] - Test failure TestMetaStoreMetrics.testConnections
- [HIVE-14839] - Improve the stability of TestSessionManagerMetrics
- [HIVE-13936] - Add streaming support for row_number
- [HIVE-15022] - Missing hs2-connection-timed-out in BeeLine.properties
- [HIVE-9423] - HiveServer2: Provide the user with different error messages depending on the Thrift client exception code
- [HIVE-12077] - MSCK Repair table should fix partitions in batches
- [HIVE-14822] - Add support for credential provider for jobs launched from Hiveserver2
- [HIVE-14799] - Query operation are not thread safe during its cancellation
- [HIVE-13997] - Insert overwrite directory doesn't overwrite existing files
- [HIVE-14889] - Beeline leaks sensitive environment variables of HiveServer2 when you type set;
- [HIVE-13237] - Select parquet struct field with upper case throws NPE
- [HIVE-14100] - Adding a new logged_in_user() UDF which returns the user provided when connecting
- [HIVE-14784] - Operation logs are disabled automatically if the parent directory does not exist.
- [HIVE-14588] - Add S3 credentials to the hidden configuration variable supported on HIVE-14207
- [HIVE-13381] - Timestamp & date should have precedence in type hierarchy than string group
- [HIVE-13625] - Hive Prepared Statement when executed with escape characters in parameter fails
- [HIVE-14805] - Subquery inside a view will have the object in the subquery as the direct input
- [HIVE-14011] - MessageFactory is not pluggable
- [HIVE-14426] - Extensive logging on info level in WebHCat
- [HIVE-12475] - Parquet schema evolution within array<struct<>> doesn't work
- [HIVE-10965] - direct SQL for stats fails in 0-column case
- [HIVE-13645] - Beeline needs null-guard around hiveVars and hiveConfVars read
- [HIVE-14743] - ArrayIndexOutOfBoundsException - HBASE-backed views' query with JOINs
- [HIVE-9570] - Investigate test failure on union_view.q [Spark Branch]
- [HIVE-14715] - Hive throws NumberFormatException with query with Null value
- [HIVE-11901] - StorageBasedAuthorizationProvider requires write permission on table for SELECT statements
- [HIVE-14697] - Can not access kerberized HS2 Web UI
- [HIVE-14538] - beeline throws exceptions with parsing hive config when using !sh statement
- [HIVE-14705] - Hive outer queries is not picking up the right column from subqueries
- [HIVE-14614] - Insert overwrite local directory fails with IllegalStateException
- [HIVE-10728] - deprecate unix_timestamp(void) and make it deterministic
- [HIVE-14360] - Starting BeeLine after using !save, there is an error logged: "Error setting configuration: conf"
- [HIVE-14436] - Hive 1.2.1/Hitting "ql.Driver: FAILED: IllegalArgumentException Error: , expected at the end of 'decimal(9'" after enabling hive.optimize.skewjoin and with MR engine
- [HIVE-12646] - beeline and HIVE CLI do not parse ; in quote properly
- [HIVE-12556] - Ctrl-C in beeline doesn't kill Tez query on HS2
- [HIVE-14207] - Strip HiveConf hidden params in webui conf
- [HIVE-14519] - Multi insert query bug
- [HIVE-14457] - Partitions in encryption zone are still trashed though an exception is returned
- [HIVE-14342] - Beeline output is garbled when executed from a remote shell
- [HIVE-14513] - Enhance custom query feature in LDAP atn to support resultset of ldap groups
- [HIVE-11696] - Exception when table-level serde is Parquet while partition-level serde is JSON
- [HIVE-13090] - Hive metastore crashes on NPE with ZooKeeperTokenStore
- [HIVE-14294] - HiveSchemaConverter for Parquet doesn't translate TINYINT and SMALLINT into proper Parquet types
- [HIVE-10722] - external table creation with msck in Hive can create unusable partition
- [HIVE-14359] - Hive on Spark might fail in HS2 with LDAP authentication in a kerberized cluster
- [HIVE-14151] - Use of USE_DEPRECATED_CLI environment variable does not work
- [HIVE-11943] - Set old CLI as the default Client when using hive script
- [HIVE-13043] - Reload function has no impact to function registry
- [HIVE-12259] - Command containing semicolon is broken in Beeline
- [HIVE-14296] - Session count is not decremented when HS2 clients do not shutdown cleanly.
- [HIVE-11778] - Merge beeline-cli branch to trunk
- [HIVE-11796] - CLI option is not updated when executing the initial files[beeline-cli]
- [HIVE-14153] - Beeline: beeline history doesn't work on Hive2
- [HIVE-12834] - Fix to accept the arrow keys in BeeLine CLI
- [HIVE-11746] - Connect command should not to be allowed from user[beeline-cli branch]
- [HIVE-11717] - nohup mode is not support for new hive cli
- [HIVE-14298] - NPE could be thrown in HMS when an ExpressionTree could not be made from a filter
- [HIVE-11640] - Shell command doesn't work for new CLI[Beeline-cli branch]
- [HIVE-14229] - the jars in hive.aux.jar.paths are not added to session classpath
- [HIVE-12246] - Orc FileDump fails with Missing CLI jar
- [HIVE-10624] - Update the initial script to make beeline bucked cli as default and allow user choose old hive cli by env
- [HIVE-11375] - Broken processing of queries containing NOT (x IS NOT NULL and x <> 0)
- [HIVE-11028] - Tez: table self join and join with another table fails with IndexOutOfBoundsException
- [HIVE-11637] - Support hive.cli.print.current.db in new CLI[beeline-cli branch]
- [HIVE-14267] - HS2 open_operations metrics not decremented when an operation gets timed out
- [HIVE-11624] - Beeline-cli: support hive.cli.print.header in new CLI[beeline-cli branch]
- [HIVE-11336] - Support initial file option for new CLI [beeline-cli branch]
- [HIVE-11352] - Avoid the double connections with 'e' option[beeline-cli branch]
- [HIVE-11280] - Support executing script file from hdfs in new CLI [Beeline-CLI branch]
- [HIVE-14135] - beeline output not formatted correctly for large column widths
- [HIVE-14187] - JDOPersistenceManager objects remain cached if MetaStoreClient#close is not called
- [HIVE-13749] - Memory leak in Hive Metastore
- [HIVE-11236] - BeeLine-Cli: use the same output format as old CLI in the new CLI
- [HIVE-11226] - BeeLine-Cli: support hive.cli.prompt in new CLI
- [HIVE-10904] - Use beeline-log4j.properties for migrated CLI [beeline-cli Branch]
- [HIVE-10824] - Need to update start script changes in .cmd files
- [HIVE-12590] - Repeated UDAFs with literals can produce incorrect result
- [HIVE-14209] - Add some logging info for session and operation management
- [HIVE-9013] - Hive set command exposes metastore db password
- [HIVE-14226] - Invalid check on an ASTNode#toStringTree in CalcitePlanner
- [HIVE-10190] - CBO: AST mode checks for TABLESAMPLE with AST.toString().contains("TOK_TABLESPLITSAMPLE")
- [HIVE-14137] - Hive on Spark throws FileAlreadyExistsException for jobs with multiple empty tables
- [HIVE-14195] - HiveMetaStoreClient getFunction() does not throw NoSuchObjectException
- [HIVE-14215] - Displaying inconsistent CPU usage data with MR execution engine
- [HIVE-14074] - RELOAD FUNCTION should update dropped functions
- [HIVE-12215] - Exchange partition does not show outputs field for post/pre execute hooks
- [HIVE-13704] - Don't call DistCp.execute() instead of DistCp.run()
- [HIVE-13862] - org.apache.hadoop.hive.metastore.MetaStoreDirectSql.getNumPartitionsViaSqlFilter falls back to ORM
- [HIVE-13372] - Hive Macro overwritten when multiple macros are used in one column
- [HIVE-12277] - Hive macro results on macro_duplicate.q different after adding ORDER BY
- [HIVE-11432] - Hive macro give same result for different arguments
- [HIVE-12635] - Hive should return the latest hbase cell timestamp as the row timestamp value
- [HIVE-14090] - JDOExceptions thrown by the Metastore have their full stack trace returned to clients
- [HIVE-14055] - directSql - getting the number of partitions is broken
- [HIVE-14149] - Joda Time causes an AmazonS3Exception on Hadoop3.0.0
- [HIVE-14013] - Describe table doesn't show unicode properly
- [HIVE-11990] - Loading data inpath from a temporary table dir fails on Windows
- [HIVE-11827] - STORED AS AVRO fails SELECT COUNT(*) when empty
- [HIVE-11490] - Lazily call ASTNode::toStringTree() after tree modification
- [HIVE-11316] - Use datastructure that doesnt duplicate any part of string for ASTNode::toStringTree()
- [HIVE-11243] - Changing log level in Utilities.getBaseWork
- [HIVE-7443] - Fix HiveConnection to communicate with Kerberized Hive JDBC server and alternative JDKs
- [HIVE-14098] - Logging task properties, and environment variables might contain passwords
- [HIVE-13991] - Union All on view fail with no valid permission on underneath table
- [HIVE-12661] - StatsSetupConst.COLUMN_STATS_ACCURATE is not used correctly
- [HIVE-13151] - Clean up UGI objects in FileSystem cache for transactions
- [HIVE-12785] - View with union type and UDF to `cast` the struct is broken
- [HIVE-14062] - Changes from HIVE-13502 overwritten by HIVE-13566
- [HIVE-11747] - Unnecessary error log is shown when executing a "INSERT OVERWRITE LOCAL DIRECTORY" cmd in the embedded mode
- [HIVE-13590] - Kerberized HS2 with LDAP auth enabled fails in multi-domain LDAP case
- [HIVE-14015] - SMB MapJoin failed for Hive on Spark when kerberized
- [HIVE-13093] - hive metastore does not exit on start failure
- [HIVE-10674] - jars should not be checked in to the source control repo
- [HIVE-14006] - Hive query with UNION ALL fails with ArrayIndexOutOfBoundsException
- [HIVE-13987] - Clarify current error shown when HS2 is down
- [HIVE-12018] - beeline --help doesn't return to original prompt
- [HIVE-13932] - Hive SMB Map Join with small set of LIMIT failed with NPE
- [HIVE-13953] - Issues in HiveLockObject equals method
- [HIVE-13462] - HiveResultSetMetaData.getPrecision() fails for NULL columns
- [HIVE-13836] - DbNotifications giving an error = Invalid state. Transaction has already started
- [HIVE-12742] - NULL table comparison within CASE does not work as previous hive versions
- [HIVE-9499] - hive.limit.query.max.table.partition makes queries fail on non-partitioned tables
- [HIVE-10925] - Non-static threadlocals in metastore code can potentially cause memory leak
- [HIVE-12650] - Improve error messages for Hive on Spark in case the cluster has no resources available
- [HIVE-13285] - Orc concatenation may drop old files from moving to final path
- [HIVE-13657] - Spark driver stderr logs should appear in hive client logs
- [HIVE-11031] - ORC concatenation of old files can fail while merging column statistics
- [HIVE-10685] - Alter table concatenate oparetor will cause duplicate data
- [HIVE-13261] - Can not compute column stats for partition when schema evolves
- [HIVE-10313] - Literal Decimal ExprNodeConstantDesc should contain value of HiveDecimal instead of String
- [HIVE-12344] - Wrong types inferred for SemiJoin generation in CBO
- [HIVE-12568] - Provide an option to specify network interface used by Spark remote client [Spark Branch]
- [HIVE-12937] - DbNotificationListener unable to clean up old notification events
- [HIVE-13632] - Hive failing on insert empty array into parquet table
- [HIVE-13115] - MetaStore Direct SQL getPartitions call fail when the columns schemas for a partition are null
- [HIVE-13302] - direct SQL: cast to date doesn't work on Oracle
- [HIVE-13585] - Add counter metric for direct sql failures
- [HIVE-10934] - Restore support for DROP PARTITION PURGE
- [HIVE-13500] - Launching big queries fails with Out of Memory Exception
- [HIVE-9826] - Firing insert event fails on temporary table
- [HIVE-12311] - explain CTAS fails if the table already exists
- [HIVE-12517] - Beeline's use of failed connection(s) causes failures and leaks.
- [HIVE-7193] - Hive should support additional LDAP authentication parameters
- [HIVE-11369] - Mapjoins in HiveServer2 fail when jmxremote is used
- [HIVE-9609] - AddPartitionMessage.getPartitions() can return null
- [HIVE-13570] - Some queries with Union all fail when CBO is off
- [HIVE-13527] - Using deprecated APIs in HBase client causes zookeeper connection leaks.
- [HIVE-6099] - Multi insert does not work properly with distinct count
- [HIVE-11408] - HiveServer2 is leaking ClassLoaders when add jar / temporary functions are used due to constructor caching in Hadoop ReflectionUtils
- [HIVE-13530] - Hive on Spark throws Kryo exception in some cases
- [HIVE-8165] - Annotation changes for replication
- [HIVE-10426] - Rework/simplify ReplicationTaskFactory instantiation
- [HIVE-10227] - Concrete implementation of Export/Import based ReplicationTaskFactory
- [HIVE-10228] - Changes to Hive Export/Import/DropTable/DropPartition to support replication semantics
- [HIVE-10074] - Ability to run HCat Client Unit tests in a system test setting
- [HIVE-8164] - Adding in a ReplicationTask that converts a Notification Event to actionable tasks
- [HIVE-9550] - ObjectStore.getNextNotification() can return events inside NotificationEventResponse as null which conflicts with its thrift "required" tag
- [HIVE-9273] - Add option to fire metastore event on insert
- [HIVE-7079] - Hive logs errors about missing tables when parsing CTE expressions
- [HIVE-9271] - Add ability for client to request metastore to fire an event
- [HIVE-13410] - PerfLog metrics scopes not closed if there are exceptions on HS2
- [HIVE-13401] - Kerberized HS2 with LDAP auth enabled fails kerberos/delegation token authentication
- [HIVE-12706] - Incorrect output from from_utc_timestamp()/to_utc_timestamp when local timezone has DST
- [HIVE-12506] - SHOW CREATE TABLE command creates a table that does not work for RCFile format
- [HIVE-11590] - AvroDeserializer is very chatty
- [HIVE-13286] - Query ID is being reused across queries
- [HIVE-13300] - Hive on spark throws exception for multi-insert with join
- [HIVE-13217] - Replication for HoS mapjoin small file needs to respect dfs.replication.max
- [HIVE-12616] - NullPointerException when spark session is reused to run a mapjoin
- [HIVE-9471] - Bad seek in uncompressed ORC, at row-group boundary.
- [HIVE-11054] - Read error : Partition Varchar column cannot be cast to string
- [HIVE-11919] - Hive Union Type Mismatch
- [HIVE-13243] - Hive drop table on encyption zone fails for external tables
- [HIVE-12684] - NPE in stats annotation when all values in decimal column are NULLs
- [HIVE-13251] - hive can't read the decimal in AVRO file generated from previous version
- [HIVE-11097] - HiveInputFormat uses String.startsWith to compare splitPath and PathToAliases
- [HIVE-11319] - CTAS with location qualifier overwrites directories
- [HIVE-13200] - Aggregation functions returning empty rows on partitioned columns
- [HIVE-11285] - ObjectInspector for partition columns in FetchOperator in SMBJoin causes exception
- [HIVE-12941] - Unexpected result when using MIN() on struct with NULL in first field
- [HIVE-13160] - HS2 unable to load UDFs on startup when HMS is not ready
- [HIVE-9486] - Use session classloader instead of application loader
- [HIVE-13082] - Enable constant propagation optimization in query with left semi join
- [HIVE-11201] - HCatalog is ignoring user specified avro schema in the table definition
- [HIVE-12084] - Hive queries with ORDER BY and large LIMIT fails with OutOfMemoryError Java heap space
- [HIVE-11768] - java.io.DeleteOnExitHook leaks memory on long running Hive Server2 Instances
- [HIVE-12156] - expanding view doesn't quote reserved keyword
- [HIVE-13099] - Non-SQLOperations lead to Web UI NPE
- [HIVE-13057] - Remove duplicate copies of TableDesc property values in PartitionDesc
- [HIVE-13065] - Hive throws NPE when writing map type data to a HBase backed table
- [HIVE-10231] - Compute partition column stats fails if partition col type is date
- [HIVE-13039] - BETWEEN predicate is not functioning correctly with predicate pushdown on Parquet table
- [HIVE-12207] - Query fails when non-ascii characters are used in string literals
- [HIVE-10698] - query on view results fails with table not found error if view is created with subquery alias (CTE).
- [HIVE-11866] - Add framework to enable testing using LDAPServer using LDAP protocol
- [HIVE-13047] - Disabling Web UI leads to NullPointerException
- [HIVE-13045] - move guava dependency back to 14 after HIVE-12952
- [HIVE-12951] - Reduce Spark executor prewarm timeout to 5s
- [HIVE-12315] - vectorization_short_regress.q has a wrong result issue for a double calculation
- [HIVE-12885] - LDAP Authenticator improvements
- [HIVE-11668] - make sure directsql calls pre-query init when needed
- [HIVE-11123] - Fix how to confirm the RDBMS product name at Metastore.
- [HIVE-11023] - Disable directSQL if datanucleus.identifierFactory = datanucleus2
- [HIVE-11898] - support default partition in metastoredirectsql
- [HIVE-12966] - Change some ZooKeeperHiveLockManager logs to debug
- [HIVE-12790] - Metastore connection leaks in HiveServer2
- [HIVE-10213] - MapReduce jobs using dynamic-partitioning fail on commit.
- [HIVE-11828] - beeline -f fails on scripts with tabs between column type and comment
- [HIVE-5672] - Insert with custom separator not supported for non-local directory
- [HIVE-11135] - Fix the Beeline set and save command in order to avoid the NullPointerException
- [HIVE-11191] - Beeline-cli: support hive.cli.errors.ignore in new CLI
- [HIVE-10821] - Beeline-CLI: Implement CLI source command using Beeline functionality
- [HIVE-10623] - Implement hive cli options using beeline functionality
- [HIVE-12909] - Some encryption q-tests fail because trash is disabled in encryption_with_trash.q
- [HIVE-9743] - Incorrect result set for vectorized left outer join
- [HIVE-12312] - Excessive logging in PPD code
- [HIVE-12782] - update the golden files for some tests that fail
- [HIVE-12538] - After set spark related config, SparkSession never get reused
- [HIVE-12045] - ClassNotFoundException for GenericUDF [Spark Branch]
- [HIVE-12788] - Setting hive.optimize.union.remove to TRUE will break UNION ALL with aggregate functions
- [HIVE-12330] - Fix precommit Spark test part2
- [HIVE-12282] - beeline - update command printing in verbose mode
- [HIVE-11977] - Hive should handle an external avro table with zero length files present
- [HIVE-12795] - Vectorized execution causes ClassCastException
- [HIVE-12784] - Group by SemanticException: Invalid column reference
- [HIVE-12708] - Hive on Spark doesn't work with Kerboresed HBase [Spark Branch]
- [HIVE-9507] - Make "LATERAL VIEW inline(expression) mytable" tolerant to nulls
- [HIVE-10514] - Fix MiniCliDriver tests failure
- [HIVE-12505] - Insert overwrite in same encrypted zone silently fails to remove some existing files
- [HIVE-12566] - Incorrect result returns when using COALESCE in WHERE condition with LEFT JOIN
- [HIVE-11288] - Avro SerDe InstanceCache returns incorrect schema
- [HIVE-12466] - SparkCounter not initialized error
- [HIVE-10178] - DateWritable incorrectly calculates daysSinceEpoch for negative Unix time
- [HIVE-12512] - Include driver logs in execution-level Operation logs
- [HIVE-12008] - Hive queries failing when using count(*) on column in view
- [HIVE-12469] - Bump Commons-Collections dependency from 3.2.1 to 3.2.2. to address vulnerability
- [HIVE-12196] - NPE when converting bad timestamp value
- [HIVE-7653] - Hive AvroSerDe does not support circular references in Schema
- [HIVE-10697] - ObjectInspectorConvertors#UnionConvertor does a faulty conversion
- [HIVE-10687] - AvroDeserializer fails to deserialize evolved union fields
- [HIVE-12304] - "drop database cascade" needs to unregister functions
- [HIVE-11143] - Tests udf_from_utc_timestamp.q/udf_to_utc_timestamp.q do not work with updated Java timezone information
- [HIVE-12396] - BucketingSortingReduceSinkOptimizer may still throw IOB exception for duplicate columns
- [HIVE-9617] - UDF from_utc_timestamp throws NPE if the second argument is null
- [HIVE-12332] - BucketingSortingReduceSinkOptimizer throws IOB exception for duplicate columns
- [HIVE-12354] - MapJoin with double keys is slow on MR
- [HIVE-12456] - QueryId can't be stored in the configuration of the SessionState since multiple queries can run in a single session
- [HIVE-7575] - GetTables thrift call is very slow
- [HIVE-12229] - Custom script in query cannot be executed in yarn-cluster mode [Spark Branch].
- [HIVE-11825] - get_json_object(col,'$.a') is null in where clause didn`t work
- [HIVE-12418] - HiveHBaseTableInputFormat.getRecordReader() causes Zookeeper connection leak.
- [HIVE-12388] - GetTables cannot get external tables when TABLE type argument is given
- [HIVE-12378] - Exception on HBaseSerDe.serialize binary field
- [HIVE-11616] - DelegationTokenSecretManager reuses the same objectstore, which has concurrency issues
- [HIVE-12365] - Added resource path is sent to cluster as an empty string when externally removed
- [HIVE-11583] - When PTF is used over a large partitions result could be corrupted
- [HIVE-10895] - ObjectStore does not close Query objects in some calls, causing a potential leak in some metastore db resources
- [HIVE-12287] - Lineage for lateral view shows wrong dependencies
- [HIVE-6091] - Empty pipeout files are created for connection create/close
- [HIVE-11915] - BoneCP returns closed connections from the pool
- [HIVE-12280] - HiveConnection does not try other HS2 after failure for service discovery
- [HIVE-11932] - JDBC Driver appends an extra "/" when configuring connection by reading httpPath from ZooKeeper
- [HIVE-11581] - HiveServer2 should store connection params in ZK when using dynamic service discovery for simpler client connection string.
- [HIVE-8890] - HiveServer2 dynamic service discovery: use persistent ephemeral nodes curator recipe
- [HIVE-10265] - Hive CLI crashes on != inequality
- [HIVE-12278] - Skip logging lineage for explain queries
- [HIVE-12268] - Context leaks deleteOnExit paths
- [HIVE-12189] - The list in pushdownPreds of ppd.ExprWalkerInfo should not be allowed to grow very large
- [HIVE-12250] - Zookeeper connection leaks in Hive's HBaseHandler.
- [HIVE-11710] - Beeline embedded mode doesn't output query progress after setting any session property
- [HIVE-11579] - Invoke the set command will close standard error output[beeline-cli]
- [HIVE-12218] - Unable to create a like table for an hbase backed table
- [HIVE-12225] - LineageCtx should release all resources at clear
- [HIVE-11149] - Fix issue with sometimes HashMap in PerfLogger.java hangs
- [HIVE-12058] - Change hive script to record errors when calling hbase fails
- [HIVE-12200] - INSERT INTO table using a select statement w/o a FROM clause fails
- [HIVE-12188] - DoAs does not work properly in non-kerberos secured HS2
- [HIVE-12082] - Null comparison for greatest and least operator
- [HIVE-12046] - Re-create spark client if connection is dropped
- [HIVE-11493] - Predicate with integer column equals double evaluates to false
- [HIVE-11826] - 'hadoop.proxyuser.hive.groups' configuration doesn't prevent unauthorized user to access metastore
- [HIVE-11802] - Float-point numbers are displayed with different precision in Beeline/JDBC
- [HIVE-10122] - Hive metastore filter-by-expression is broken for non-partition expressions
- [HIVE-12007] - Hive LDAP Authenticator should allow just Domain without baseDN (for AD)
- [HIVE-12035] - branch-1 build broken
- [HIVE-10083] - SMBJoin fails in case one table is uninitialized
- [HIVE-11835] - Type decimal(1,1) reads 0.0, 0.00, etc from text file as NULL
- [HIVE-8398] - ExprNodeColumnDesc cannot be cast to ExprNodeConstantDesc
- [HIVE-11982] - Some test cases for union all fail with recent changes
- [HIVE-11995] - Remove repetitively setting permissions in insert/load overwrite partition
- [HIVE-11926] - Stats annotation might not extract stats for varchar/decimal columns
- [HIVE-11745] - Alter table Exchange partition with multiple partition_spec is not working
- [HIVE-11132] - Queries using join and group by produce incorrect output when hive.auto.convert.join=false and hive.optimize.reducededuplication=true
- [HIVE-9644] - Fold case & when udfs
- [HIVE-9645] - Constant folding case NULL equality
- [HIVE-9587] - UDF decode should accept STRING_GROUP types for the second parameter
- [HIVE-9754] - rename GenericUDFLevenstein to GenericUDFLevenshtein
- [HIVE-11940] - "INSERT OVERWRITE" query is very slow because it creates one "distcp" per file to copy data from staging directory to target directory
- [HIVE-11939] - TxnDbUtil should turn off jdbc auto commit
- [HIVE-8529] - HiveSessionImpl#fetchResults should not try to fetch operation log when hive.server2.logging.operation.enabled is false.
- [HIVE-10665] - Continue to make udaf_percentile_approx_23.q test more stable
- [HIVE-10059] - Make udaf_percentile_approx_23.q test more stable
- [HIVE-10788] - Change sort_array to support non-primitive types
- [HIVE-9468] - Test groupby3_map_skew.q fails due to decimal precision difference
- [HIVE-10980] - Merge of dynamic partitions loads all data to default partition
- [HIVE-11820] - export tables with size of >32MB throws "java.lang.IllegalArgumentException: Skip CRC is valid only with update options"
- [HIVE-11607] - Export tables broken for data > 32 MB
- [HIVE-11695] - If user have no permission to create LOCAL DIRECTORY the Hql does not throw any exception and fail silently.
- [HIVE-11824] - Insert to local directory causes staging directory to be copied
- [HIVE-11839] - Vectorization wrong results with filter of (CAST AS CHAR)
- [HIVE-11834] - Lineage doesn't work with dynamic partitioning query
- [HIVE-11077] - Add support in parser and wire up to txn manager
- [HIVE-11843] - Add 'sort by c' to Parquet PPD q-tests to avoid different output issues with hadoop-1
- [HIVE-11817] - Window function max NullPointerException
- [HIVE-11771] - Parquet timestamp conversion errors
- [HIVE-5277] - HBase handler skips rows with null valued first cells when only row key is selected
- [HIVE-9984] - JoinReorder's getOutputSize is exponential
- [HIVE-9908] - vectorization error binary type not supported, group by with binary columns
- [HIVE-10465] - whitelist restrictions don't get initialized in new copy of HiveConf
- [HIVE-9086] - Add language support to PURGE data while dropping partitions.
- [HIVE-9083] - New metastore API to support to purge partition-data directly in dropPartitions().
- [HIVE-7049] - Unable to deserialize AVRO data when file schema and record schema are different and nullable
- [HIVE-11737] - IndexOutOfBounds compiling query with duplicated groupby keys
- [HIVE-11618] - Correct the SARG api to reunify the PredicateLeaf.Type INTEGER and LONG
- [HIVE-10021] - "Alter index rebuild" statements submitted through HiveServer2 fail when Sentry is enabled
- [HIVE-10421] - DROP TABLE with qualified table name ignores database name when checking partitions
- [HIVE-10451] - PTF deserializer fails if values are not used in reducer
- [HIVE-11712] - Duplicate groupby keys cause ClassCastException
- [HIVE-4239] - Remove lock on compilation stage
- [HIVE-11604] - HIVE return wrong results in some queries with PTF function
- [HIVE-5545] - HCatRecord getInteger method returns String when used on Partition columns of type INT
- [HIVE-9580] - Server returns incorrect result from JOIN ON VARCHAR columns
- [HIVE-10140] - Window boundary is not compared correctly
- [HIVE-11502] - Map side aggregation is extremely slow
- [HIVE-11586] - ObjectInspectorFactory.getReflectionObjectInspector is not thread-safe
- [HIVE-10719] - Hive metastore failure when alter table rename is attempted.
- [HIVE-10085] - Lateral view on top of a view throws RuntimeException
- [HIVE-10288] - Cannot call permanent UDFs
- [HIVE-11549] - Hide Hive configuration from spark driver launching process
- [HIVE-11580] - ThriftUnionObjectInspector#toString throws NPE
- [HIVE-11464] - lineage info missing if there are multiple outputs
- [HIVE-10996] - Aggregation / Projection over Multi-Join Inner Query producing incorrect results
- [HIVE-9613] - Left join query plan outputs wrong column when using subquery
- [HIVE-7476] - CTAS does not work properly for s3
- [HIVE-10875] - Select query with view in subquery adds underlying table as direct input
- [HIVE-11250] - Change in spark.executor.instances (and others) doesn't take effect after RSC is launched for HS2 [Spark Brnach]
- [HIVE-11433] - NPE for a multiple inner join query
- [HIVE-10972] - DummyTxnManager always locks the current database in shared mode, which is incorrect.
- [HIVE-11258] - The function drop_database_core() of HiveMetaStore.java may not drop all the tables
- [HIVE-11172] - Vectorization wrong results for aggregate query with where clause without group by
- [HIVE-11426] - lineage3.q fails with -Phadoop-1
- [HIVE-11401] - Predicate push down does not work with Parquet when partitions are in the expression
- [HIVE-10553] - Remove hardcoded Parquet references from SearchArgumentImpl
- [HIVE-11203] - Beeline force option doesn't force execution when errors occurred in a script.
- [HIVE-11333] - ColumnPruner prunes columns of UnionOperator that should be kept
- [HIVE-11271] - java.lang.IndexOutOfBoundsException when union all with if function
- [HIVE-11255] - get_table_objects_by_name() in HiveMetaStore.java needs to retrieve table objects in multiple batches
- [HIVE-11279] - Hive should emit lineage information in json compact format
- [HIVE-11216] - UDF GenericUDFMapKeys throws NPE when a null map value is passed in
- [HIVE-11174] - Hive does not treat floating point signed zeros as equal (-0.0 should equal 0.0 according to IEEE floating point spec)
- [HIVE-11129] - Issue a warning when copied from UTF-8 to ISO 8859-1
- [HIVE-11194] - Exchange partition on external tables should fail with error message when target folder already exists
- [HIVE-10270] - Cannot use Decimal constants less than 0.1BD
- [HIVE-10801] - 'drop view' fails throwing java.lang.NullPointerException
- [HIVE-11184] - Lineage - ExprProcFactory#getExprString may throw NullPointerException
- [HIVE-11150] - Remove wrong warning message related to chgrp
- [HIVE-10572] - Improve Hive service test to check empty string
- [HIVE-9934] - Vulnerability in LdapAuthenticationProviderImpl enables HiveServer2 client to degrade the authentication mechanism to "none", allowing authentication without password
- [HIVE-10573] - Improve Hive service for connection
- [HIVE-10538] - Fix NPE in FileSinkOperator from hashcode mismatch
- [HIVE-10706] - Make vectorized_timestamp_funcs test more stable
- [HIVE-11157] - Hive.get(HiveConf) returns same Hive object to different user sessions
- [HIVE-11095] - SerDeUtils another bug ,when Text is reused
- [HIVE-10437] - NullPointerException on queries where map/reduce is not involved on tables with partitions
- [HIVE-11100] - Beeline should escape semi-colon in queries
- [HIVE-11112] - ISO-8859-1 text output has fragments of previous longer rows appended
- [HIVE-11062] - Remove Exception stacktrace from Log.info when ACL is not supported.
- [HIVE-11109] - Replication factor is not properly set in SparkHashTableSinkOperator [Spark Branch]
- [HIVE-10841] - [WHERE col is not null] does not work sometimes for queries with many JOIN statements
- [HIVE-10944] - Fix HS2 for Metrics
- [HIVE-10977] - No need to instantiate MetaStoreDirectSql when HMS DirectSql is disabled
- [HIVE-10976] - Redundant HiveMetaStore connect check in HS2 CLIService start
- [HIVE-10659] - Beeline command which contains semi-colon as a non-command terminator will fail
- [HIVE-10956] - HS2 leaks HMS connections
- [HIVE-10963] - Hive throws NPE rather than meaningful error message when window is missing
- [HIVE-9665] - Parallel move task optimization causes race condition
- [HIVE-10571] - HiveMetaStoreClient should close existing thrift connection before its reconnect
- [HIVE-10803] - document jdbc url format properly
- [HIVE-10835] - Concurrency issues in JDBC driver
- [HIVE-10808] - Inner join on Null throwing Cast Exception
- [HIVE-10771] - "separatorChar" has no effect in "CREATE TABLE AS SELECT" statement
- [HIVE-10732] - Hive JDBC driver does not close operation for metadata queries
- [HIVE-10151] - insert into A select from B is broken when both A and B are Acid tables and bucketed the same way
- [HIVE-10483] - insert overwrite partition deadlocks on itself with DbTxnManager
- [HIVE-10050] - Support overriding memory configuration for AM launched for TempletonControllerJob
- [HIVE-10242] - ACID: insert overwrite prevents create table command
- [HIVE-10481] - ACID table update finishes but values not really updated if column names are not all lower case
- [HIVE-10150] - delete from acidTbl where a in(select a from nonAcidOrcTbl) fails
- [HIVE-10721] - SparkSessionManagerImpl leaks SparkSessions [Spark Branch]
- [HIVE-10671] - yarn-cluster mode offers a degraded performance from yarn-client [Spark Branch]
- [HIVE-10453] - HS2 leaking open file descriptors when using UDFs
- [HIVE-10646] - ColumnValue does not handle NULL_TYPE
- [HIVE-10626] - Spark paln need to be updated [Spark Branch]
- [HIVE-10202] - Beeline outputs prompt+query on standard output when used in non-interactive mode
- [HIVE-10541] - Beeline requires newline at the end of each query in a file
- [HIVE-9625] - Delegation tokens for HMS are not renewed
- [HIVE-10499] - Ensure Session/ZooKeeperClient instances are closed
- [HIVE-10312] - SASL.QOP in JDBC URL is ignored for Delegation token Authentication
- [HIVE-10473] - Spark client is recreated even spark configuration is not changed
- [HIVE-10324] - Hive metatool should take table_param_key to allow for changes to avro serde's schema url key
- [HIVE-10087] - Beeline's --silent option should suppress query from being echoed when running with -f option
- [HIVE-10098] - HS2 local task for map join fails in KMS encrypted cluster
- [HIVE-10093] - Unnecessary HMSHandler initialization for default MemoryTokenStore on HS2
- [HIVE-10108] - Index#getIndexTableName() returns db.index_table_name
- [HIVE-10143] - HS2 fails to clean up Spark client state on timeout [Spark Branch]
- [HIVE-10086] - Hive throws error when accessing Parquet file schema using field name match
- [HIVE-10073] - Runtime exception when querying HBase with Spark [Spark Branch]
- [HIVE-9839] - HiveServer2 leaks OperationHandle on async queries which fail at compile phase
- [HIVE-9228] - Problem with subquery using windowing functions
- [HIVE-10023] - Fix more cache related concurrency issue [Spark Branch]
- [HIVE-10009] - LazyObjectInspectorFactory is not thread safe [Spark Branch]
- [HIVE-9994] - Hive query plan returns sensitive data to external applications
- [HIVE-9993] - Retrying task could use cached bad operators [Spark Branch]
- [HIVE-9920] - DROP DATABASE IF EXISTS throws exception if database does not exist
- [HIVE-9961] - HookContext for view should return a table type of VIRTUAL_VIEW
- [HIVE-9828] - Semantic analyzer does not capture view parent entity for tables referred in view with union all
- [HIVE-9866] - Changing a column's type doesn't change column stats type in metastore
- [HIVE-9813] - Hive JDBC - DatabaseMetaData.getColumns method cannot find classes added with "add jar" command
- [HIVE-9929] - StatsUtil#getAvailableMemory could return negative value
- [HIVE-9877] - Beeline cannot run multiple statements in the same row
- [HIVE-9935] - Fix tests for java 1.8 [Spark Branch]
- [HIVE-9620] - Cannot retrieve column statistics using HMS API if column name contains uppercase characters
- [HIVE-9916] - Fix TestSparkSessionManagerImpl [Spark Branch]
- [HIVE-9855] - Runtime skew join doesn't work when skewed data only exists in big table
- [HIVE-9601] - New Beeline queries will hang If Beeline terminates in-properly [Spark Branch]
- [HIVE-9902] - Map join small table files need more replications [Spark Branch]
- [HIVE-9860] - MapredLocalTask/SecureCmdDoAs leaks local files
- [HIVE-9872] - Making yarn-cluster default for Hive on Spark [Spark Branch]
- [HIVE-9803] - SparkClientImpl should not attempt impersonation in CLI mode [Spark Branch]
- [HIVE-9831] - HiveServer2 should use ConcurrentHashMap in ThreadFactory
- [HIVE-9847] - Hive should not allow additional attemps when RSC fails [Spark Branch]
- [HIVE-9830] - Map join could dump a small table multiple times [Spark Branch]
- [HIVE-9770] - Beeline ignores --showHeader for non-tablular output formats i.e csv,tsv,dsv
- [HIVE-9755] - Hive built-in "ngram" UDAF fails when a mapper has no matches.
- [HIVE-7271] - Speed up unit tests
- [HIVE-9655] - Dynamic partition table insertion error
- [HIVE-9716] - Map job fails when table's LOCATION does not have scheme
- [HIVE-9749] - ObjectStore schema verification logic is incorrect
- [HIVE-9706] - HBase handler support for snapshots should confirm properties before use
- [HIVE-9622] - Getting NPE when trying to restart HS2 when metastore is configured to use org.apache.hadoop.hive.thrift.DBTokenStore
- [HIVE-9023] - HiveHistoryImpl relies on removed counters to print num rows
- [HIVE-9648] - Null check key provider before doing set
- [HIVE-9496] - Slf4j warning in hive command
- [HIVE-9303] - Parquet files are written with incorrect definition levels
- [HIVE-9526] - ClassCastException thrown by HiveStatement
- [HIVE-9482] - Hive parquet timestamp compatibility
- [HIVE-9501] - DbNotificationListener doesn't include dbname in create database notification and does not include tablename in create table notification
- [HIVE-9489] - add javadoc for UDFType annotation
- [HIVE-9436] - RetryingMetaStoreClient does not retry JDOExceptions
- [HIVE-9474] - truncate table changes permissions on the target
- [HIVE-9484] - ThriftCLIService#getDelegationToken does case sensitive comparison
Improvement
- [HIVE-10815] - Let HiveMetaStoreClient Choose MetaStore Randomly
- [HIVE-16912] - Improve table validator's performance against Oracle
- [HIVE-17237] - HMS wastes 26.4% of memory due to dup strings in metastore.api.Partition.parameters
- [HIVE-17132] - Add InterfaceAudience and InterfaceStability annotations for UDF APIs
- [HIVE-15068] - Run ClearDanglingScratchDir periodically inside HS2
- [HIVE-15792] - Hive should raise SemanticException when LPAD/RPAD pad character's length is 0
- [HIVE-16974] - Change the sort key for the schema tool validator to be <ID>
- [HIVE-17191] - Add InterfaceAudience and InterfaceStability annotations for StorageHandler APIs
- [HIVE-12063] - Pad Decimal numbers with trailing zeros to the scale of the column
- [HIVE-15321] - Change to read as long for HiveConf.ConfVars.METASTORESERVERMAXMESSAGESIZE
- [HIVE-11240] - Change value type from int to long for HiveConf.ConfVars.METASTORESERVERMAXMESSAGESIZE
- [HIVE-17072] - Make the parallelized timeout configurable in BeeLine tests
- [HIVE-17131] - Add InterfaceAudience and InterfaceStability annotations for SerDe APIs
- [HIVE-15473] - Progress Bar on Beeline client
- [HIVE-10130] - Merge trunk to Spark branch 03/27/2015 [Spark Branch]
- [HIVE-9152] - Dynamic Partition Pruning [Spark Branch]
- [HIVE-12274] - Increase width of columns used for general configuration in the metastore.
- [HIVE-16914] - Change HiveMetaStoreClient to AutoCloseable
- [HIVE-16729] - Improve location validator to check for blank paths.
- [HIVE-14080] - hive.metastore.schema.verification should check for schema compatiblity
- [HIVE-16771] - Schematool should use MetastoreSchemaInfo to get the metastore schema version from database
- [HIVE-14389] - Beeline should not output query and prompt to stdout
- [HIVE-10975] - Parquet: Bump the parquet version up to 1.8.1
- [HIVE-16723] - Enable configurable MetaStoreSchemaInfo
- [HIVE-11487] - Add getNumPartitionsByFilter api in metastore api
- [HIVE-16711] - Remove property_id column from metastore_db_properties table
- [HIVE-16700] - Log ZK discovery info (hostname & port) for HTTP mode when connection is established
- [HIVE-16697] - Schema table validator should return a sorted list of missing tables
- [HIVE-16647] - Improve the validation output to make the output to stderr and stdout more consistent
- [HIVE-11418] - Dropping a database in an encryption zone with CASCADE and trash enabled fails
- [HIVE-14340] - Add a new hook triggers before query compilation and after query execution
- [HIVE-16146] - If possible find a better way to filter the TestBeeLineDriver output
- [HIVE-16079] - HS2: high memory pressure due to duplicate Properties objects
- [HIVE-16449] - BeeLineDriver should handle query result sorting
- [HIVE-13780] - Allow user to update AVRO table schema via command even if table's definition was defined through schema file
- [HIVE-16345] - BeeLineDriver should be able to run qtest files which are using default database tables
- [HIVE-15616] - Improve contents of qfile test output
- [HIVE-16426] - Query cancel: improve the way to handle files
- [HIVE-16430] - Add log to show the cancelled query id when cancelOperation is called.
- [HIVE-16242] - Run BeeLine tests parallel
- [HIVE-16152] - TestBeeLineDriver logging improvements
- [HIVE-13183] - More logs in operation logs
- [HIVE-16127] - Separate database initialization from actual query run in TestBeeLineDriver
- [HIVE-14459] - TestBeeLineDriver - migration and re-enable
- [HIVE-16334] - Query lock contains the query string, which can cause OOM on ZooKeeper
- [HIVE-13884] - Disallow queries in HMS fetching more than a configured number of partitions
- [HIVE-16297] - Improving hive logging configuration variables
- [HIVE-16164] - Provide mechanism for passing HMS notification ID between transactional and non-transactional listeners.
- [HIVE-1575] - get_json_object does not support JSON array at the root level
- [HIVE-16286] - Log canceled query id
- [HIVE-16166] - HS2 may still waste up to 15% of memory on duplicate strings
- [HIVE-16205] - Improving type safety in Objectstore
- [HIVE-16119] - HiveMetaStoreChecker: remove singleThread logic duplication
- [HIVE-13782] - Compile async query asynchronously
- [HIVE-16176] - SchemaTool should exit with non-zero exit code when one or more validator's fail.
- [HIVE-15916] - Add blobstore tests for CTAS
- [HIVE-16014] - HiveMetastoreChecker should use hive.metastore.fshandler.threads instead of hive.mv.files.thread for pool size
- [HIVE-15882] - HS2 generating high memory pressure with many partitions and concurrent queries
- [HIVE-15430] - Change SchemaTool table validator to test based on the dbType
- [HIVE-15879] - Fix HiveMetaStoreChecker.checkPartitionDirs method
- [HIVE-14803] - S3: Stats gathering for insert queries can be expensive for partitioned dataset
- [HIVE-13179] - Allow custom HiveConf to be passed to Authentication Providers
- [HIVE-15895] - Use HDFS for stats collection temp dir on blob storage
- [HIVE-15550] - fix arglist logging in schematool
- [HIVE-15815] - Allow to pass some Oozie properties to Spark in HoS
- [HIVE-14423] - S3: Fetching partition sizes from FS can be expensive when stats are not available in metastore
- [HIVE-15805] - Some minor improvement on the validation tool
- [HIVE-14920] - S3: Optimize SimpleFetchOptimizer::checkThreshold()
- [HIVE-14854] - Add a core cluster type to QTestUtil
- [HIVE-14824] - Separate fstype from cluster type in QTestUtil
- [HIVE-14818] - Reduce number of retries while starting HiveServer for tests
- [HIVE-14651] - Add a local cluster for Tez and LLAP
- [HIVE-14625] - Minor qtest fixes
- [HIVE-12236] - Enable SimpleFetchOptimizer for more query types
- [HIVE-11882] - Fetch optimizer should stop source files traversal once it exceeds the hive.fetch.task.conversion.threshold
- [HIVE-13902] - [Refactor] Minimize metastore jar dependencies on task nodes
- [HIVE-13901] - Hivemetastore add partitions can be slow depending on filesystems
- [HIVE-15546] - Optimize Utilities.getInputPaths() so each listStatus of a partition is done in parallel
- [HIVE-14204] - Optimize loading dynamic partitions
- [HIVE-9138] - Add some explain to PTF operator
- [HIVE-13726] - Improve dynamic partition loading VI
- [HIVE-15572] - Improve the response time for query canceling when it happens during acquiring locks
- [HIVE-14914] - Improve the 'TestClass' did not produce a TEST-*.xml file message
- [HIVE-14540] - Add support in ptest to create batches for non qfile tests
- [HIVE-13511] - Run clidriver tests from within the qtest dir for the precommit tests
- [HIVE-12897] - Improve dynamic partition loading
- [HIVE-14887] - Reduce the memory used by MiniMr, MiniTez, MiniLlap tests
- [HIVE-15009] - ptest - avoid unnecessary cleanup from previous test runs in batch-exec.vm
- [HIVE-14835] - Improve ptest2 build time
- [HIVE-14781] - ptest killall command does not work
- [HIVE-14663] - Change ptest java language version to 1.7, other version changes and fixes
- [HIVE-14561] - Minor ptest2 improvements
- [HIVE-13763] - Update smart-apply-patch.sh with ability to use patches from git
- [HIVE-15392] - Refactoring the validate function of HiveSchemaTool to make the output consistent
- [HIVE-14511] - Improve MSCK for partitioned table to deal with special cases
- [HIVE-13507] - Improved logging for ptest
- [HIVE-13505] - Skip running TestDummy where possibe during precommit builds
- [HIVE-15410] - WebHCat supports get/set table property with its name containing period and hyphen
- [HIVE-15025] - Secure-Socket-Layer (SSL) support for HMS
- [HIVE-12222] - Define port range in property for RPCServer
- [HIVE-10641] - create CRC32 UDF
- [HIVE-15232] - Add notification events for functions and indexes
- [HIVE-15114] - Remove extra MoveTask operators from the ConditionalTask
- [HIVE-15121] - Last MR job in Hive should be able to write to a different scratch directory
- [HIVE-9664] - Hive "add jar" command should be able to download and add jars from a repository
- [HIVE-15093] - S3-to-S3 Renames: Files should be moved individually rather than at a directory level
- [HIVE-12908] - Improve dynamic partition loading III
- [HIVE-10276] - Implement date_format(timestamp, fmt) UDF
- [HIVE-10644] - create SHA2 UDF
- [HIVE-9518] - Implement MONTHS_BETWEEN aligned with Oracle one
- [HIVE-13933] - Add an option to turn off parallel file moves
- [HIVE-13716] - Improve dynamic partition loading V
- [HIVE-12988] - Improve dynamic partition loading IV
- [HIVE-14373] - Add integration tests for hive on S3
- [HIVE-11032] - Enable more tests for grouping by skewed data [Spark Branch]
- [HIVE-14444] - Upgrade qtest execution framework to junit4 - migrate most of them
- [HIVE-13429] - Tool to remove dangling scratch dir
- [HIVE-15000] - Remove addlocaldriverjar, and addlocaldrivername from command line help
- [HIVE-11538] - Add an option to skip init script while running tests
- [HIVE-14762] - Add logging while removing scratch space
- [HIVE-13058] - Add session and operation_log directory deletion messages
- [HIVE-13760] - Add a HIVE_QUERY_TIMEOUT configuration to kill a query if a query is running for more than the configured timeout value.
- [HIVE-4924] - JDBC: Support query timeout for jdbc
- [HIVE-14383] - SparkClientImpl should pass principal and keytab to spark-submit instead of calling kinit explicitely
- [HIVE-12653] - The property "serialization.encoding" in the class "org.apache.hadoop.hive.contrib.serde2.MultiDelimitSerDe" does not work
- [HIVE-10007] - Support qualified table name in analyze table compute statistics for columns
- [HIVE-13670] - Improve Beeline connect/reconnect semantics
- [HIVE-13033] - SPDO unnecessarily duplicates columns in key & value of mapper output
- [HIVE-14270] - Write temporary data to HDFS when doing inserts on tables located on S3
- [HIVE-13788] - hive msck listpartitions need to make use of directSQL instead of datanucleus
- [HIVE-14118] - Make the alter partition exception more meaningful
- [HIVE-14085] - Allow type widening primitive conversion on hive/parquet tables
- [HIVE-14001] - beeline doesn't give out an error when takes either "-e" or "-f" in command instead of both
- [HIVE-6758] - Beeline doesn't work with -e option when started in background
- [HIVE-12721] - Add UUID built in function
- [HIVE-5193] - Columnar Pushdown for RC/ORC File not happening in HCatLoader
- [HIVE-4570] - More information to user on GetOperationStatus in Hive Server2 when query is still executing
- [HIVE-13420] - Clarify HS2 WebUI Query 'Elapsed TIme'
- [HIVE-12983] - Provide a builtin function to get Hive version
- [HIVE-13797] - Provide a connection string example in beeline
- [HIVE-13616] - Investigate renaming a table without invalidating the column stats
- [HIVE-13696] - Monitor fair-scheduler.xml and automatically update/validate jobs submitted to fair-scheduler
- [HIVE-13783] - Display a secondary prompt on beeline for multi-line statements
- [HIVE-12481] - Occasionally "Request is a replay" will be thrown from HS2
- [HIVE-12958] - Make embedded Jetty server more configurable
- [HIVE-11617] - Explain plan for multiple lateral views is very slow
- [HIVE-13376] - HoS emits too many logs with application state
- [HIVE-13295] - Improvement to LDAP search queries in HS2 LDAP Authenticator
- [HIVE-13171] - Add unit test for hs2 webui
- [HIVE-13244] - Get 'test-query-redactor.json' file on TestQueryRedactor.java from the resources classpath
- [HIVE-13222] - Move rc-file-v0.rc used on TestRCFile.java to src/test/resources
- [HIVE-13206] - Create a test-sources.jar when -Psources profile is invoked
- [HIVE-11512] - Hive LDAP Authenticator should also support full DN in Authenticate()
- [HIVE-12946] - alter table should also add default scheme and authority for the location similar to create table
- [HIVE-10115] - HS2 running on a Kerberized cluster should offer Kerberos(GSSAPI) and Delegation token(DIGEST) when alternate authentication is enabled
- [HIVE-12952] - Show query sub-pages on webui
- [HIVE-10214] - log metastore call timing information aggregated at query level
- [HIVE-12550] - Cache and display last N completed queries in HS2 WebUI
- [HIVE-9774] - Print yarn application id to console [Spark Branch]
- [HIVE-12549] - Display execution engine in HS2 webui query view
- [HIVE-12611] - Make sure spark.yarn.queue is effective and takes the value from mapreduce.job.queuename if given [Spark Branch]
- [HIVE-12811] - Name yarn application name more meaning than just "Hive on Spark"
- [HIVE-12761] - Add stack trace servlet to HS2 web ui
- [HIVE-12431] - Support timeout for compile lock
- [HIVE-12091] - Merge file doesn't work for ORC table when running on Spark. [Spark Branch]
- [HIVE-10027] - Use descriptions from Avro schema files in column comments
- [HIVE-12485] - Secure HS2 web UI with kerberos
- [HIVE-12471] - Secure HS2 web UI with SSL
- [HIVE-12338] - Add webui to HiveServer2
- [HIVE-12713] - Miscellaneous improvements in driver compile and execute logging
- [HIVE-11692] - Fix UT regressions on hbase-metastore branch
- [HIVE-9500] - Support nested structs over 24 levels.
- [HIVE-8769] - Physical optimizer : Incorrect CE results in a shuffle join instead of a Map join (PK/FK pattern not detected)
- [HIVE-10568] - Select count(distinct()) can have more optimal execution plan
- [HIVE-11513] - AvroLazyObjectInspector could handle empty data better
- [HIVE-10119] - Allow Log verbosity to be set in hiveserver2 session
- [HIVE-11638] - ExprNodeDesc hashMap accidentally degrades into O(N) instead of O(1)
- [HIVE-10999] - Upgrade Spark dependency to 1.4 [Spark Branch]
- [HIVE-9906] - Add timeout mechanism in RawStoreProxy
- [HIVE-9819] - Add timeout check inside the HMS server
- [HIVE-9253] - MetaStore server should support timeout for long running requests
- [HIVE-12317] - Emit current database in lineage info
- [HIVE-12235] - Improve beeline logging for dynamic service discovery
- [HIVE-9298] - Support reading alternate timestamp formats
- [HIVE-12265] - Generate lineage info only if requested
- [HIVE-11131] - Get row information on DataWritableWriter once for better writing performance
- [HIVE-12187] - Release plan once a query is executed
- [HIVE-9605] - Remove parquet nested objects from wrapper writable objects
- [HIVE-9333] - Move parquet serialize implementation to DataWritableWriter to improve write speeds
- [HIVE-10048] - JDBC - Support SSL encryption regardless of Authentication mechanism
- [HIVE-10834] - Support First_value()/last_value() over x preceding and y preceding windowing
- [HIVE-11984] - Add HS2 open operation metrics
- [HIVE-9858] - Create cbrt (cube root) UDF
- [HIVE-9480] - Build UDF TRUNC to implement FIRST_DAY as compared with LAST_DAY
- [HIVE-9744] - Move common arguments validation and value extraction code to GenericUDF
- [HIVE-9738] - create SOUNDEX udf
- [HIVE-9556] - create UDF to calculate the Levenshtein distance between two strings
- [HIVE-9596] - move standard getDisplayString impl to GenericUDF
- [HIVE-9594] - Add qtests for LAST_DAY udf
- [HIVE-9590] - add qtests for ADD_MONTHS UDF
- [HIVE-11891] - Add basic performance logging to metastore calls
- [HIVE-11816] - Upgrade groovy to 2.4.4
- [HIVE-10910] - Alter table drop partition queries in encrypted zone failing to remove data from HDFS
- [HIVE-11814] - Emit query time in lineage info
- [HIVE-11763] - Use * instead of sum(hash(*)) on Parquet predicate (PPD) integration tests
- [HIVE-2573] - Create per-session function registry
- [HIVE-10702] - COUNT(*) over windowing 'x preceding and y preceding' doesn't work properly
- [HIVE-10826] - Support min()/max() functions over x preceding and y preceding windowing
- [HIVE-10650] - Improve sum() function over windowing to support additional range formats
- [HIVE-10643] - Refactoring Windowing for sum() to pass WindowFrameDef instead of two numbers (1 for number of preceding and 1 for number of following)
- [HIVE-10658] - Insert with values clause may expose data that should be encrypted
- [HIVE-11504] - Predicate pushing down doesn't work for float type for Parquet
- [HIVE-10319] - Hive CLI startup takes a long time with a large number of databases
- [HIVE-11534] - Improve validateTableCols error message
- [HIVE-9327] - CBO (Calcite Return Path): Removing Row Resolvers from ParseContext
- [HIVE-10166] - Merge Spark branch to master 7/30/2015
- [HIVE-11363] - Prewarm Hive on Spark containers [Spark Branch]
- [HIVE-10252] - Make PPD work for Parquet in row group level
- [HIVE-11139] - Emit more lineage information
- [HIVE-11314] - Print "Execution completed successfully" as part of spark job info [Spark Branch]
- [HIVE-11239] - Test encryption_insert_partition_static.q fails with different output results on other environments
- [HIVE-10717] - Fix failed qtest encryption_insert_partition_static test in Jenkin
- [HIVE-10006] - RSC has memory leak while execute multi queries.[Spark Branch]
- [HIVE-10594] - Remote Spark client doesn't use Kerberos keytab to authenticate [Spark Branch]
- [HIVE-10629] - Dropping table in an encrypted zone does not drop warehouse directory
- [HIVE-10630] - Renaming tables across encryption zones renames table even though the operation throws error
- [HIVE-9143] - select user(), current_user()
- [HIVE-5472] - support a simple scalar which returns the current timestamp
- [HIVE-10527] - NPE in SparkUtilities::isDedicatedCluster [Spark Branch]
- [HIVE-10476] - Hive query should fail when it fails to initialize a session in SetSparkReducerParallelism [Spark Branch]
- [HIVE-10434] - Cancel connection when remote Spark driver process has failed [Spark Branch]
- [HIVE-10302] - Load small tables (for map join) in executor memory only once [Spark Branch]
- [HIVE-8858] - Visualize generated Spark plan [Spark Branch]
- [HIVE-10304] - Add deprecation message to HiveCLI
- [HIVE-10291] - Hive on Spark job configuration needs to be logged [Spark Branch]
- [HIVE-10206] - Improve Alter Table to not initialize Serde unnecessarily
- [HIVE-10146] - Not count session as idle if query is running
- [HIVE-9666] - Improve some qtests
- [HIVE-9871] - Print spark job id in history file [spark branch]
- [HIVE-9659] - 'Error while trying to create table container' occurs during hive query case execution when hive.optimize.skewjoin set to 'true' [Spark Branch]
- [HIVE-9882] - Add jar/file doesn't work with yarn-cluster mode [Spark Branch]
- [HIVE-9788] - Make double quote optional in tsv/csv/dsv output
- [HIVE-9823] - Load spark-defaults.conf from classpath [Spark Branch]
- [HIVE-9804] - Turn on some kryo settings by default for Spark [Spark Branch]
- [HIVE-9781] - Utilize spark.kryo.classesToRegister [Spark Branch]
- [HIVE-9793] - Remove hard coded paths from cli driver tests
- [HIVE-9671] - Support Impersonation [Spark Branch]
- [HIVE-9726] - Upgrade to spark 1.3 [Spark Branch]
- [HIVE-9561] - SHUFFLE_SORT should only be used for order by query [Spark Branch]
- [HIVE-9517] - UNION ALL query failed with ArrayIndexOutOfBoundsException [Spark Branch]
- [HIVE-9425] - Add jar/file doesn't work with yarn-cluster mode [Spark Branch]
- [HIVE-9574] - Lazy computing in HiveBaseFunctionResultList may hurt performance [Spark Branch]
- [HIVE-7175] - Provide password file option to beeline
- [HIVE-8379] - NanoTimeUtils performs some work needlessly
- [HIVE-7313] - Allow in-memory/ssd session-level temp-tables
- [HIVE-9192] - One-pass SMB Optimizations [Spark Branch]
- [HIVE-9379] - Fix tests with some versions of Spark + Snappy [Spark Branch]
New Feature
- [HIVE-12270] - Add DBTokenStore support to HS2 delegation token
- [HIVE-8838] - Support Parquet through HCatalog
- [HIVE-16643] - BeeLine tests output should keep the PREHOOK/POSTHOOK Input/Output orderdering
- [HIVE-16556] - Modify schematool scripts to initialize and create METASTORE_DB_PROPERTIES table
- [HIVE-16555] - Add a new thrift API call for get_metastore_uuid
- [HIVE-10924] - add support for MERGE statement
- [HIVE-10329] - Hadoop reflectionutils has issues
- [HIVE-13198] - Authorization issues with cascading views
- [HIVE-13620] - Merge llap branch work to master
- [HIVE-12080] - Support auto type widening (int->bigint & float->double) for Parquet table
- [HIVE-13964] - Add a parameter to beeline to allow a properties file to be passed in
- [HIVE-9302] - Beeline add commands to register local jdbc driver names and jars
- [HIVE-13736] - View's input/output formats are TEXT by default
- [HIVE-10636] - CASE comparison operator rotation optimization
- [HIVE-6791] - Support variable substition for Beeline shell command
- [HIVE-12868] - Fix empty operation-pool metrics
- [HIVE-11785] - Support escaping carriage return and new line for LazySimpleSerDe
- [HIVE-12499] - Add HMS metrics for number of tables and partitions
- [HIVE-11903] - Add lock metrics to HS2
- [HIVE-12271] - Add metrics around HS2 query execution and job submission for Hive
- [HIVE-10985] - Value based windowing on timestamp and double can't handle NULL value
- [HIVE-10906] - Value based UDAF function without orderby expression throws NPE
- [HIVE-10911] - Add support for date datatype in the value based windowing function
- [HIVE-10927] - Add number of HMS/HS2 connection metrics
- [HIVE-10761] - Create codahale-based metrics system for Hive
Task
- [HIVE-11671] - Optimize RuleRegExp in DPP codepath
- [HIVE-16313] - Flaky test: TestBeeLineDriver[drop_with_concurrency]
- [HIVE-14974] - TestBeeLineHistory throws NPE in ShutdownHook
- [HIVE-14461] - Move hbase_bulk to run via TestCliDriver.
- [HIVE-14721] - Fix TestJdbcWithMiniHS2 runtime
- [HIVE-14850] - Fix TestJdbcDriver2 set up time
- [HIVE-14720] - Fix TestJdbcDriver2 runtime
- [HIVE-14559] - Remove setting hive.execution.engine in qfiles
- [HIVE-14973] - Flaky test: TestJdbcWithSQLAuthorization.testBlackListedUdfUsage
- [HIVE-14761] - Remove TestJdbcWithMiniMr after merging tests with TestJdbcWithMiniHS2
- [HIVE-14670] - org.apache.hadoop.hive.ql.TestMTQueries failure
- [HIVE-14552] - TestParseNegative fix
- [HIVE-15881] - Use hive.exec.input.listing.max.threads variable name instead of mapred.dfsclient.parallelism.max
- [HIVE-14956] - Parallelize TestHCatLoader
- [HIVE-14681] - Remove AssertionError for non zero return codes in QTestUtil cleanup
- [HIVE-15008] - Cleanup local workDir when MiniHS2 starts up in FS_ONLY mode
- [HIVE-14891] - Parallelize TestHCatStorer
- [HIVE-13868] - Include derby.log file in the Hive ptest logs
- [HIVE-13603] - Fix ptest unit tests broken by HIVE13505
- [HIVE-15363] - Execute hive-blobstore tests using ProxyLocalFileSystem
- [HIVE-15168] - Flaky test: TestSparkClient.testJobSubmission (still flaky)
- [HIVE-14910] - Flaky test: TestSparkClient.testJobSubmission
- [HIVE-15226] - Add a different masking comment to qtests blobstore output
- [HIVE-15246] - Add a making comment to blobstore staging paths on qtest output
- [HIVE-14753] - Track the number of open/closed/abandoned sessions in HS2
- [HIVE-14810] - Failing test: TestMetaStoreMetrics.testMetaDataCounts
- [HIVE-14817] - Shutdown the SessionManager timeoutChecker thread properly upon shutdown
- [HIVE-14775] - Cleanup IOException usage in Metrics APIs
- [HIVE-14358] - Add metrics for number of queries executed for each execution engine (mr, spark, tez)
- [HIVE-10485] - Create md5 UDF
- [HIVE-9657] - Use new parquet Types API builder to construct data types
- [HIVE-12987] - Add metrics for HS2 active users and SQL operations
- [HIVE-12164] - Remove jdbc stats collection mechanism
- [HIVE-10325] - Remove ExprNodeNullEvaluator
- [HIVE-9520] - Create NEXT_DAY UDF
- [HIVE-9861] - Add spark-assembly on Hive's classpath [Spark Branch]
- [HIVE-9411] - Improve error messages in TestMultiOutputFormat
- [HIVE-9498] - Update golden files of join38 & subquery_in on trunk due to 9327
Test
- [HIVE-14713] - LDAP Authentication Provider should be covered with unit tests
- [HIVE-13798] - Fix the unit test failure org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_ivyDownload
- [HIVE-15736] - Add unit tests to Utilities.getInputSummary() method for multi-threading cases
- [HIVE-7375] - Add option in test infra to compile in other profiles (like hadoop-1)
- [HIVE-12621] - PTest Backup additional Tez/Spark logs
- [HIVE-11887] - some tests break the build on a shared machine, can break HiveQA
- [HIVE-13786] - Fix the unit test failure org.apache.hive.service.cli.session.TestHiveSessionImpl.testLeakOperationHandle
- [HIVE-10396] - decimal_precision2.q test is failing on trunk
- [HIVE-12628] - Eliminate flakiness in TestMetrics
- [HIVE-11946] - TestNotificationListener is flaky
- [HIVE-11620] - Fix several qtest output order
Wish
- [HIVE-10427] - collect_list() and collect_set() should accept struct types as argument