GithubHelp home page GithubHelp logo

teradata / presto Goto Github PK

View Code? Open in Web Editor NEW

This project forked from prestodb/presto

93.0 35.0 22.0 204.7 MB

Teradata Distribution of Presto -- A Distributed SQL Query Engine for Big Data

Home Page: http://www.teradata.com/presto

License: Apache License 2.0

Java 98.66% Makefile 0.03% Python 0.02% HTML 0.21% SQLPL 0.03% PLSQL 0.03% JavaScript 0.74% ANTLR 0.10% Shell 0.10% CSS 0.05% Thrift 0.04%
hadoop sql cloud teradata

presto's Introduction

Presto Build Status

Presto is a distributed SQL query engine for big data.

See the User Manual for deployment instructions and end user documentation.

Requirements

  • Mac OS X or Linux
  • Java 8 Update 92 or higher (8u92+), 64-bit
  • Maven 3.3.9+ (for building)
  • Python 2.4+ (for running with the launcher script)

Building Presto

Presto is a standard Maven project. Simply run the following command from the project root directory:

./mvnw clean install

On the first build, Maven will download all the dependencies from the internet and cache them in the local repository (~/.m2/repository), which can take a considerable amount of time. Subsequent builds will be faster.

Presto has a comprehensive set of unit tests that can take several minutes to run. You can disable the tests when building:

./mvnw clean install -DskipTests

Running Presto in your IDE

Overview

After building Presto for the first time, you can load the project into your IDE and run the server. We recommend using IntelliJ IDEA. Because Presto is a standard Maven project, you can import it into your IDE using the root pom.xml file. In IntelliJ, choose Open Project from the Quick Start box or choose Open from the File menu and select the root pom.xml file.

After opening the project in IntelliJ, double check that the Java SDK is properly configured for the project:

  • Open the File menu and select Project Structure
  • In the SDKs section, ensure that a 1.8 JDK is selected (create one if none exist)
  • In the Project section, ensure the Project language level is set to 8.0 as Presto makes use of several Java 8 language features

Presto comes with sample configuration that should work out-of-the-box for development. Use the following options to create a run configuration:

  • Main Class: com.facebook.presto.server.PrestoServer
  • VM Options: -ea -XX:+UseG1GC -XX:G1HeapRegionSize=32M -XX:+UseGCOverheadLimit -XX:+ExplicitGCInvokesConcurrent -Xmx2G -Dconfig=etc/config.properties -Dlog.levels-file=etc/log.properties
  • Working directory: $MODULE_DIR$
  • Use classpath of module: presto-main

The working directory should be the presto-main subdirectory. In IntelliJ, using $MODULE_DIR$ accomplishes this automatically.

Additionally, the Hive plugin must be configured with location of your Hive metastore Thrift service. Add the following to the list of VM options, replacing localhost:9083 with the correct host and port (or use the below value if you do not have a Hive metastore):

-Dhive.metastore.uri=thrift://localhost:9083

Using SOCKS for Hive or HDFS

If your Hive metastore or HDFS cluster is not directly accessible to your local machine, you can use SSH port forwarding to access it. Setup a dynamic SOCKS proxy with SSH listening on local port 1080:

ssh -v -N -D 1080 server

Then add the following to the list of VM options:

-Dhive.metastore.thrift.client.socks-proxy=localhost:1080

Running the CLI

Start the CLI to connect to the server and run SQL queries:

presto-cli/target/presto-cli-*-executable.jar

Run a query to see the nodes in the cluster:

SELECT * FROM system.runtime.nodes;

In the sample configuration, the Hive connector is mounted in the hive catalog, so you can run the following queries to show the tables in the Hive database default:

SHOW TABLES FROM hive.default;

Developers

We recommend you use IntelliJ as your IDE. The code style template for the project can be found in the codestyle repository along with our general programming and Java guidelines. In addition to those you should also adhere to the following:

  • Alphabetize sections in the documentation source files (both in table of contents files and other regular documentation files). In general, alphabetize methods/variables/sections if such ordering already exists in the surrounding code.
  • When appropriate, use the Java 8 stream API. However, note that the stream implementation does not perform well so avoid using it in inner loops or otherwise performance sensitive sections.
  • Categorize errors when throwing exceptions. For example, PrestoException takes an error code as an argument, PrestoException(HIVE_TOO_MANY_OPEN_PARTITIONS). This categorization lets you generate reports so you can monitor the frequency of various failures.
  • Ensure that all files have the appropriate license header; you can generate the license by running mvn license:format.
  • Consider using String formatting (printf style formatting using the Java Formatter class): format("Session property %s is invalid: %s", name, value) (note that format() should always be statically imported). Sometimes, if you only need to append something, consider using the + operator.
  • Avoid using the ternary operator except for trivial expressions.
  • Use an assertion from Airlift's Assertions class if there is one that covers your case rather than writing the assertion by hand. Over time we may move over to more fluent assertions like AssertJ.
  • When writing a Git commit message, follow these guidelines.

presto's People

Contributors

alandpost avatar arhimondr avatar arturgajowy avatar cawallin avatar cberner avatar dain avatar ebyhr avatar electrum avatar erichwang avatar fiedukow avatar findepi avatar geraint0923 avatar haozhun avatar highker avatar joyyao8 avatar kokosing avatar losipiuk avatar maciejgrzybek avatar martint avatar miniway avatar nezihyigitbasi avatar nileema avatar pnowojski avatar raghavsethi avatar rschlussel-zz avatar sopel39 avatar wenleix avatar yaoxin226 avatar yuananf avatar zhenxiao avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

presto's Issues

Clustered Hive tables support????

I installed presto_server_pkg.0.167-t.0.2 on the cento7.2.
According to the document installed presto-admin,and the coordinator and workers are on a machine。

  1. Data Sources
    flume-->hive
    hive table:create table test (bytes_in int,bytes_out int,device_id string,device_type string,host string,latency int,level string,method string,msg string,path string,referer string,remote_ip string,response_code int,route string,status int,type string,uri string,user_agent string,user_id bigint,time string)
    PARTITIONED BY(year string,month string,day string)
    clustered by (user_id) into 5 buckets
    stored as orc
    TBLPROPERTIES ("transactional"="true");

  2. hdfs path
    /user/hive/warehouse/logs.db/test/year=2017/month=10/day=26/delta_0018401_0018500/bucket_0000*
    According to the document,I have set hive.multi-file-bucketing.enabled config property in the hive.properties and set session property.
    presto:logs> set session hive.multi_file_bucketing_enabled = true;
    SET SESSION
    but,through the use of presto query error:
    presto:logs> select count(*) from test;
    Query 20171026_065023_00034_4pi6j failed: Hive table is corrupt. It is declared as being bucketed, but the files do not match the bucketing declaration. Found sub-directory in bucket directory for partition: year=2017/month=10/day=26

According to the document,
https://teradata.github.io/presto/docs/current/release/release-0.167-t.html
Fix issue “Hive table is corrupt. It is declared as being bucketed, but the files do not match the bucketing declaration. The number of files in the directory (1) does not match the declared.” by fixing support for Hive bucketed tables. See option hive.multi-file-bucketing.enabled in the Presto Hive connector documentation.

but,,,,,,,
who can help me????????

Can't find /etc/presto/event-listener.properties with default settings

Dear team,

I made my own event-listener, and put the event-listener.properties in /etc/presto/
However Presto can't find event-listener.properties.
Also, the log.properties is not read by presto because there is no setting to use it in presto-server-rpm.

I checked the launcher.py , and found it is always checking files under [INSTALL_FOLDER]/etc,(launcher.py)

By presto-admin, it is putting all the properties files in /etc/presto/,
So I think it is better to create a symbolic link in presto-server-rpm/postinstall, how do you think about this?
https://github.com/prestodb/presto/blob/master/presto-server-rpm/src/main/rpm/postinstall#L21

ln -snf /etc/presto/ /usr/lib/presto/etc

How can I allow multiple presto coordinator's urls from NGINX

I wanna allow two presto urls from nginx. bellow is the configuration of nginx.
presto-1 IP :172.18.0.14
presto-2 IP :172.18.0.12
but only one presto coordinator is running at a time. I wanna allow both presto working.

Below is the content of this file /etc/nginx/conf.d/ssl.conf :

server {
listen 443 ssl;
server_name _;

ssl_certificate             /root/server.crt;
ssl_certificate_key         /root/server.key;
ssl_protocols               TLSv1 TLSv1.1 TLSv1.2;
ssl_ciphers                 HIGH:!aNULL:!MD5;

# Load configuration files for the default server block.
include /etc/nginx/default.d/*.conf;

location /presto-1/{
proxy_http_version 1.1;
proxy_set_header Upgrade $http_upgrade;
proxy_set_header Connection "upgrade";
proxy_pass https://172.18.0.14:8443/;
}
location /presto-2/{
proxy_http_version 1.1;
proxy_set_header Upgrade $http_upgrade;
proxy_set_header Connection "upgrade";
proxy_pass https://172.18.0.12:8443/;
}
location /v1/query/{
proxy_pass https://172.18.0.14:8443/v1/query/;
}
location /v1/cluster/{
proxy_pass https://172.18.0.14:8443/v1/cluster/;
}
location /v1/info/{
proxy_pass http://172.18.0.14:9090/v1/info;
}
}

Release notes for 0.179-t

Generated by git log 0.179..release-0.179-t --no-decorate --format="- [ ] %an (committed by %cn) %H %s" | sort + manually dividing up the sections

Akshat Nair

  • Akshat Nair (committed by Akshat Nair) 00fe485 Add unit tests for DetermineSemiJoinDistributionType
  • Akshat Nair (committed by Akshat Nair) 9a7dfaf Get the correct distribution type for ROJ and LOJ
  • Akshat Nair (committed by Akshat Nair) a25882a Remove costing for DetermineSemiJoinDistributionType
  • Akshat Nair (committed by Akshat Nair) d4698f3 Add options information in the CLI documentation
  • Akshat Nair (committed by Akshat Nair) ddf2a39 Fix PlanPrinter formatting
  • Akshat Nair (committed by Akshat Nair) e103b6e Update doc for Cost in Explain
  • Akshat Nair (committed by Akshat Nair) fd40256 Migrate DeteremineJoinType to Iterative optimizer
  • Akshat Nair (committed by Rebecca Schlussel) 36b54a8 Document current rule based optimizer
  • Nezih Yigitbasi (committed by Akshat Nair) b75ed52 Fix formatting in EXPLAIN ANALYZE output

Alan Post

  • Alan Post (committed by Alan Post) 33158e6 Merge branch 'support_ldap_authentication_between_nodes_rebase_59' into sprint-59
  • Alan Post (committed by Alan Post) 7d173f8 Don't ask Hive for statistics for hidden columns
  • Alan Post (committed by Alan Post) d99ae36 Fix InlineProjections on subqueries

Amruta Gokhale

  • Amruta Gokhale (committed by Amruta Gokhale) 3f8070f Add tests for selectivity estimates for parts of TPC-H queries
  • Amruta Gokhale (committed by Amruta Gokhale) 5a1804a Quarantine test selectFromPartitionedNation on EMR
  • Amruta Gokhale (committed by Amruta Gokhale) 9988a19 Revert "Quarantine failing product-test"
  • Amruta Gokhale (committed by Amruta Gokhale) b21b650 Modify tests to account for an additional ExchangeNode in the plan
  • Amruta Gokhale (committed by Amruta Gokhale) bb3e5e0 Add a configuration option to disable creating non-managed Hive table
  • Amruta Gokhale (committed by Amruta Gokhale) ca06cb1 Document note about authorization without authentication
  • Amruta Gokhale (committed by Amruta Gokhale) d2a8eb9 Minor cleanup in event listener tests
  • Amruta Gokhale (committed by Christina Wallin) 703634c Wait longer for schema agreement in Cassandra tests
  • Amruta Gokhale (committed by Christina Wallin) 74e5f8f Retry when retrieving metadata in Cassandra tests

Andrii Rosa

  • Andrii Rosa (committed by Alan Post) 5d0af1f Non interactive presto-cli password
  • Andrii Rosa (committed by Alan Post) a60f3ae Secured internal communication with LDAP product tests
  • Andrii Rosa (committed by Alan Post) ac1ab24 Document LDAP secured internal communication
  • Andrii Rosa (committed by Alan Post) ef94661 Implement LDAP authentication for internal communication
  • Andrii Rosa (committed by Amruta Gokhale) 0194f7f Simplify checkTablePermission
  • Andrii Rosa (committed by Amruta Gokhale) 0ed59fd Move parsePrivilege to MetastoreUtil
  • Andrii Rosa (committed by Amruta Gokhale) 134a017 Introduce APPLICABLE_ROLES view
  • Andrii Rosa (committed by Amruta Gokhale) 17570c6 Remove redundant checkDatabasePermission methods
  • Andrii Rosa (committed by Amruta Gokhale) 1a2f9c2 Refactor HivePrivilegeInfo
  • Andrii Rosa (committed by Amruta Gokhale) 1ed43c8 Introduce CREATE ROLE and DROP ROLE statements
  • Andrii Rosa (committed by Amruta Gokhale) 266b2c5 Speedup TestHiveFileBasedSecurity
  • Andrii Rosa (committed by Amruta Gokhale) 3094513 Reorder methods in HivePrivilegeInfo
  • Andrii Rosa (committed by Amruta Gokhale) 345352c Store catalog selected roles in Identity
  • Andrii Rosa (committed by Amruta Gokhale) 35da07e Introduce ConnectorIdentity
  • Andrii Rosa (committed by Amruta Gokhale) 4737448 Implement SET ROLE in Hive Connector
  • Andrii Rosa (committed by Amruta Gokhale) 473ba04 Expose Create/Drop/List roles methods in SPI
  • Andrii Rosa (committed by Amruta Gokhale) 4e10da7 Remove unused InMemoryHiveMetastore
  • Andrii Rosa (committed by Amruta Gokhale) 4ffea19 Rename getGrantOptionForPrivilege to hasGrantOptionForPrivilege
  • Andrii Rosa (committed by Amruta Gokhale) 5a91f22 Catalog access control for roles
  • Andrii Rosa (committed by Amruta Gokhale) 60454eb Prepare metastore interface to accept ROLE for GRANT/REVOKE
  • Andrii Rosa (committed by Amruta Gokhale) 6206ecb More product tests for SET ROLE
  • Andrii Rosa (committed by Amruta Gokhale) 6ab9036 Add grantor to HivePrivilegeInfo
  • Andrii Rosa (committed by Amruta Gokhale) 7fca572 Implement SET ROLE
  • Andrii Rosa (committed by Amruta Gokhale) 807b8e1 Reorder methods in SqlStandardAccessControl
  • Andrii Rosa (committed by Amruta Gokhale) 808d5d6 Consider enabled roles for permissions
  • Andrii Rosa (committed by Amruta Gokhale) 80bd7b2 Implement Grant/Revoke/ListApplicableRoles in Hive
  • Andrii Rosa (committed by Amruta Gokhale) 819df95 Introduce .information_schema.roles table
  • Andrii Rosa (committed by Amruta Gokhale) 8433700 Introduce SET ROLE statement
  • Andrii Rosa (committed by Amruta Gokhale) 89091e1 Add grantor_type and grantee_type columns to table_privileges
  • Andrii Rosa (committed by Amruta Gokhale) 9c0b61e Allow all for admin role
  • Andrii Rosa (committed by Amruta Gokhale) a561d98 Refactor GRANT/REVOKE in Hive
  • Andrii Rosa (committed by Amruta Gokhale) afb23c3 Introduce ENABLED_ROLES view
  • Andrii Rosa (committed by Amruta Gokhale) bbde32c Introduce GRANT/REVOKE roles statements
  • Andrii Rosa (committed by Amruta Gokhale) c555471 Assign admin role to subset of users in FileHiveMetastore
  • Andrii Rosa (committed by Amruta Gokhale) c77e9b9 Force an exchange to avoid potential coordinator overload
  • Andrii Rosa (committed by Amruta Gokhale) d2fc4b3 Accept ROLE in GRANT/REVOKE Privileges statements
  • Andrii Rosa (committed by Amruta Gokhale) d69272c Remove hive privilege null check
  • Andrii Rosa (committed by Amruta Gokhale) dc1ff6d Introduce access control for GRANT/REVOKE ROLE
  • Andrii Rosa (committed by Amruta Gokhale) dd38d52 Implement Create/Drop/List roles in Hive connector
  • Andrii Rosa (committed by Amruta Gokhale) e48b56b Add Grant/Revoke/List roles authorization to the SPI
  • Andrii Rosa (committed by Amruta Gokhale) e8c03f8 Introduce isTableOwner method for readability
  • Andrii Rosa (committed by Amruta Gokhale) f2376a8 Move PrincipalType to presto-spi
  • Andrii Rosa (committed by Amruta Gokhale) fea21e6 Refactor canCreateView security checks
  • Andrii Rosa (committed by Andrii Rosa) 9d83483 Consider local exchange in cost calculators
  • Andrii Rosa (committed by Anton Petrov) 854c7b0 Clear interrupt flag after exiting from pager
  • Andrii Rosa (committed by Christina Wallin) 2e20eaa Add property that allows to disable sort redistribution
  • Andrii Rosa (committed by Christina Wallin) 30accd0 Use binary heap in MergeSort algorithm
  • Andrii Rosa (committed by Christina Wallin) 5ba2045 Compile merge sort comparator
  • Andrii Rosa (committed by Christina Wallin) 704a971 Add product test that verifies HTTP port is closed on multinode-tls
  • Andrii Rosa (committed by Christina Wallin) 878d923 Implement MergeSortProcessor
  • Andrii Rosa (committed by Christina Wallin) 89cf4be Add product tests suite for secured internal communication
  • Andrii Rosa (committed by Christina Wallin) 8a2dd96 Support secure internal communication
  • Andrii Rosa (committed by Christina Wallin) 93f413e Process pending reads upon noMorePages event
  • Andrii Rosa (committed by Christina Wallin) 9592366 AbstractTestQueries for Distributed sort
  • Andrii Rosa (committed by Christina Wallin) b6bdaaf Temporary workaround for providing keystore for Presto JDBC
  • Andrii Rosa (committed by Christina Wallin) c6e02a2 Implement local merge sort
  • Andrii Rosa (committed by Christina Wallin) d233e84 Implement MergeOperator
  • Andrii Rosa (committed by Christina Wallin) f4da555 Document encryption performance impact
  • Andrii Rosa (committed by Christina Wallin) fbae5b7 Separate product tests into 2 builds on travis
  • Andrii Rosa (committed by Maciej Grzybek) 5d95cb7 Pass list of projection columns to SplitManager
  • Andrii Rosa (committed by Maciej Grzybek) 761e666 Pass list of projection columns to ConnectorSplitManager
  • Andrii Rosa (committed by Maciej Grzybek) 77f0e3c Fix failing tests in presto-main
  • Andrii Rosa (committed by Maciej Grzybek) 99ed9db Print window node specific statistics in explain analyze
  • Andrii Rosa (committed by Maciej Grzybek) 9ab3ecb Log tests executed by AbstractTestQueryFramework
  • Andrii Rosa (committed by Maciej Grzybek) b0243e2 Treat fixed pointer literals as DECIMAL
  • Andrii Rosa (committed by Maciej Grzybek) d1bcac1 Add decimal as default literal unit tests
  • Andrii Rosa (committed by Maciej Grzybek) f5ac832 Add "parse decimal literal as double" legacy switch
  • Andrii Rosa (committed by anusudarsan) 8a462b5 Support _HOST placeholder in Kerberos internal communication

Andrzej Fiedukowicz

  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) 0307542 Add stats calculation for FilterNode logical operations
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) 03a0ffc Split ErrorMessages into regions
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) 06b53ab Add FilterStatsCalculator
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) 0d5d1e2 Add new TIME/TIMESTAMP parsing/printing to DateTimeUtils
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) 0f08d1a Add config and session property deprecated.legacy-timestamp
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) 111ebec Add default error message for Exceptions on CLI side
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) 14a2975 Introduce CLI custom message for Server starting up/shutting down error
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) 163bbb7 Fix current_time, localtime & localtimestamp semantics
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) 16b744d Introduce CLI custom message for server not found error
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) 2e506e2 Add tests covering Time types representation
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) 3938797 Fix at_timezone(TIME WITH TIME ZONE)
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) 3e33814 Empty commit for triggering build
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) 403cb52 Split TestDateTimeFunctions into variants based on isLegacyTimestamp
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) 4100946 Introduce new TIMESTAMP semantics to to_iso8601 scalar
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) 4a39f12 fixup! Migrate MetadataQueryOptimizer to iterative optimizer
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) 4dab122 Add isLegacyTimestamp flag to ConnectorSession
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) 524f456 Add new date time semantics to date time cast operators
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) 544d068 Fix references in docs
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) 594d8b5 Use new stats calculator by default
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) 5e87fe5 Make travis upload PR artifacts to different directory
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) 5f32dde Introduce CLI custom message for 404 http status error
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) 5fc709d Change default TIME/TIMESTAMP semantics to ANSI SQL
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) 65a45ad Fix current_time timezone offset
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) 6989a50 Introduce new TIME/TIMESTAMP semantics SPI types
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) 6b60799 Split TestDateTimeOperators into variants based on isLegacyTimestamp
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) 86a0a1c Add CLI --quiet option allowing to remove tips from error messages
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) 88b21d3 Quarantine failing product-test
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) 8de1b63 Always use fixed TIME as session base in TestDateTimeFunctions
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) 9b30960 Add stats calculation for FilterNode comparison related operators
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) 9be41be Add rule for computing stats for FilterNode
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) 9f54334 Introduce new TIME/TIMESTAMP semantics to scalar functions
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) ad14ce3 Add stats calculation for FilterNode boolean expressions
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) ad97b55 fixup! Add UnionStatsRule
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) ae13afc Add stats calculation for FilterNode comparisons
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) b12c275 Add new TIME/TIMESTAMP semantic variant to Expression parsing
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) bdec222 Add more detailed description of Hive connector properties
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) c6b8bc7 Add query properties description
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) cca9c71 Add basic comments to ErrorMessages class
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) cfdb6a2 Add session properties description
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) d009bcf Add stats calculation for FilterNode is (not) null expression
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) d726bbc Extract ErrorMessages class for CLI readable user messages creation
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) d90a281 Introduce new TIME/TIMESTAMP semantics to SQL types
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) e9aeaae Make product-tests quiet about CLI errors
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) fb527f9 Unify --debug output for all errors in CLI
  • Andrzej Fiedukowicz (committed by Andrzej Fiedukowicz) fd3f208 Introduce DomainConverted

Anton Petrov

  • Anton Petrov (committed by Anton Petrov) 759fe32 Add documentation for the TPC-DS connector
  • Anton Petrov (committed by Anton Petrov) aae372c Unquarantine TPC-DS q27, q36, q70 and q86
  • Anton Petrov (committed by Anton Petrov) b2cedb0 fixup! update version
  • Anton Petrov (committed by Anton Petrov) bc0c6c8 Remove stale comments on TPC-DS q27, q36, q70, q86
  • Anton Petrov (committed by Anton Petrov) c57d247 Add the metadata portion of the TPC-DS connector
  • Anton Petrov (committed by Anton Petrov) d5d266b Minor re-writes to TPC-DS q36, q70 and q86
  • James Sun (committed by Anton Petrov) 36c6bdc Fix native memory leak in RCFile writer
  • Min(Dongmin Yu) (committed by Anton Petrov) 77ade86 Add missing JsonCreator annotation
  • Nezih Yigitbasi (committed by Anton Petrov) de04852 Remove unused fields

Anu Sudarsan

  • Anu Sudarsan (committed by anusudarsan) 0466ebe Add profile for testing Kerberos secured internal communication
  • Anu Sudarsan (committed by anusudarsan) 04e4c33 Document Kerberos secured internal communication
  • Anu Sudarsan (committed by anusudarsan) 0d79db3 Add tests infrastructure for Kerberos secured internal communication
  • Anu Sudarsan (committed by anusudarsan) e5b55bf Support for Kerberos secured internal communication
  • Anu Sudarsan (committed by Łukasz Osipiuk) 45a359e Add product-tests for bucketed hive tables
  • anusudarsan (committed by Brian Rickman) 17cc4ee Fix the http client used in tests to use kerberos credentials
  • anusudarsan (committed by anusudarsan) 0a9a809 Migrate OptimizeMixedDistinctAggregations to iterative optimizer
  • anusudarsan (committed by anusudarsan) 2d9eb84 fixup! Check violation
  • anusudarsan (committed by anusudarsan) 7929b91 Migrate MetadataQueryOptimizer to iterative optimizer
  • anusudarsan (committed by anusudarsan) ac4e24b Fix incorrect results bug in legacy MetadataQueryOptimizer

Artur Gajowy

  • Artur Gajowy (committed by Andrzej Fiedukowicz) 028d7f4 Add Optionals.{checkPresent,withBoth,combine} util methods
  • Artur Gajowy (committed by Andrzej Fiedukowicz) 126b767 Support column stats in TPCH connector
  • Artur Gajowy (committed by Andrzej Fiedukowicz) 5d33a99 Change argument type in TpchMetadata.getPrestoType to TpchColumn
  • Artur Gajowy (committed by Andrzej Fiedukowicz) 8752563 Add Types.{checkType,checkSameTypes,tryCast} util methods
  • Artur Gajowy (committed by Andrzej Fiedukowicz) b96752f Add RecordTpchTableStatsTool for recording stats summaries in .json
  • Artur Gajowy (committed by Andrzej Fiedukowicz) c10c329 Add EstimateAssertion
  • Artur Gajowy (committed by Andrzej Fiedukowicz) cec3dd7 Add Constraint.alwaysFalse() method
  • Artur Gajowy (committed by Andrzej Fiedukowicz) cfde85a Add TestTpchMetadata
  • Artur Gajowy (committed by Andrzej Fiedukowicz) dc65d16 Add statistics recordings for tpch.{tiny,sf1}
  • Artur Gajowy (committed by Artur Gajowy) 254bd6e fixup! Remove explicit data_size from PlanNodeStatsEstimate
  • Artur Gajowy (committed by Artur Gajowy) 27a904f Make unknown costs the default in PlanNodeCostEstimate.Builder
  • Artur Gajowy (committed by Artur Gajowy) 325a51c Mention spill for join in release notes
  • Artur Gajowy (committed by Artur Gajowy) 42cb285 Add 0.179-t release notes placeholder
  • Artur Gajowy (committed by Artur Gajowy) 5353ee2 Mention spill for join in properties documentation
  • Artur Gajowy (committed by Artur Gajowy) 6d3633c fixup! Introduce CostCalculator interface
  • Artur Gajowy (committed by Artur Gajowy) 6e0e66a Add TestCostCalculator.CostAssertionBuilder#cpu/network/memoryUnknown()
  • Artur Gajowy (committed by Artur Gajowy) 70005e6 Quarantine TestReorderJoins tests depending on cost calculation details
  • Artur Gajowy (committed by Artur Gajowy) 7d27870 fixup! Introduce CostComparator
  • Artur Gajowy (committed by Artur Gajowy) aa64650 fixup! Introduce CostCalculator interface
  • Artur Gajowy (committed by Artur Gajowy) ca886de fixup! Introduce CostCalculator interface
  • Artur Gajowy (committed by Artur Gajowy) cf0d369 Replace the 'experimental' word with 'beta' when referring to spill
  • Artur Gajowy (committed by Artur Gajowy) dda08fc Test that CostCalculator successfully returns unknown costs for no stats
  • Artur Gajowy (committed by Artur Gajowy) deb7fbd fixup! Add ReorderJoins rule to pick the best join order
  • Artur Gajowy (committed by Karol Sobczak) fab27b6 fixup! Remove explicit data_size from PlanNodeStatsEstimate
  • Artur Gajowy (committed by Łukasz Osipiuk) 2af6299 Add TestDistributedSpilledQueries
  • Artur Gajowy (committed by Łukasz Osipiuk) 4033e33 Add getAllSpilledPages method to SingleStreamSpiller
  • Artur Gajowy (committed by Łukasz Osipiuk) 8d79e02 Introduce PartitionedConsumption
  • Artur Gajowy (committed by Łukasz Osipiuk) 9e26e9e Check no write after read in FileSingleStreamSpiller

Brian Rickman

  • Brian Rickman (committed by Brian Rickman) 0c01a4c Support running tests with alternative JDBC drivers
  • Brian Rickman (committed by Brian Rickman) 145c4aa Add tests for GRANT and REVOKE
  • Brian Rickman (committed by Brian Rickman) 2e34c79 Export ports used by Kerberos services This facilitates testing clients that run on the host machine.
  • Brian Rickman (committed by Brian Rickman) 34e7a28 Fix Simba specific JDBC tests
  • Brian Rickman (committed by Brian Rickman) 3b35b61 Test PreparedStatement * Test INSERT with all supported data types * Test SELECT * Test PreparedStatement SQL syntax
  • Brian Rickman (committed by Brian Rickman) 6bce71e Quarantine testRevokeRoleFromUser until the Presto bug it demonstrates is fixed.
  • Brian Rickman (committed by Brian Rickman) 8bb3e95 Update expected error message for LdapTests.shouldFailForUserWithColon()
  • Brian Rickman (committed by Brian Rickman) 8c57520 Fix TestAllDatatypesFromHiveConnector.testSelectAllDatatypesParquetFile for Simba JDBC
  • Brian Rickman (committed by Brian Rickman) 8fe6524 Expand testRevokeRoleFromUser The test now verifies that prilileges from the role are not available to the user after the role has been revoked.
  • Brian Rickman (committed by Brian Rickman) 94f671b Fix test failures for Simba JDBC driver
  • Brian Rickman (committed by Brian Rickman) a70f27f Refactor with static string CREATE_TABLE_SQL
  • Brian Rickman (committed by Brian Rickman) bb5d62c Package product-test scripts
  • Brian Rickman (committed by Brian Rickman) bcedef4 New SSL certificate to replace expired certificate.
  • Brian Rickman (committed by GitHub) 9cb4e1b Merge pull request #673 from Teradata/docker-cluster-ssl-certificate
  • Brian Rickman (committed by GitHub) ec38a04 Merge pull request #672 from Teradata/docker-kerberos-ports
  • Brian Rickman (committed by Łukasz Osipiuk) 9bd6da6 Fix checkstyle issue

Christina Wallin

  • Christina Wallin (committed by Alan Post) de2a937 Add a note about LDAP service user for secure communication
  • Christina Wallin (committed by Amruta Gokhale) 1509ab4 Add SHOW CURRENT ROLES
  • Christina Wallin (committed by Amruta Gokhale) 19513d0 Implement listRoleGrants() in Hive
  • Christina Wallin (committed by Amruta Gokhale) 2ab7ae5 Add SHOW ROLE GRANTS syntax
  • Christina Wallin (committed by Amruta Gokhale) 31ff122 Add access control checks for SHOW ROLES
  • Christina Wallin (committed by Amruta Gokhale) 5583243 Product tests for SHOW ROLES
  • Christina Wallin (committed by Amruta Gokhale) 6ba87d2 Add SHOW ROLES to the parser
  • Christina Wallin (committed by Amruta Gokhale) 7558fa6 Add listRoleGrants to the SPI
  • Christina Wallin (committed by Amruta Gokhale) a5b9ad2 Implement SHOW ROLE GRANTS rewrite
  • Christina Wallin (committed by Amruta Gokhale) bc501c8 Add docs for SHOW ROLES
  • Christina Wallin (committed by Amruta Gokhale) c24a04c Add docs for SHOW ROLE GRANTS
  • Christina Wallin (committed by Amruta Gokhale) cc1a626 Access control for SHOW ROLE GRANTS and SHOW CURRENT ROLES
  • Christina Wallin (committed by Amruta Gokhale) f1f1e9e Rewrite SHOW ROLES as a select query
  • Christina Wallin (committed by Christina Wallin) 000c92e Add TableLayout to the TableScanMatcher
  • Christina Wallin (committed by Christina Wallin) 1abe853 Remove trailing spaces from CHAR function results
  • Christina Wallin (committed by Christina Wallin) 1ad4176 Pass prepared statements in HTTP request body
  • Christina Wallin (committed by Christina Wallin) 1fa8b77 Update docker images version to 19
  • Christina Wallin (committed by Christina Wallin) 23aaf0f Unquarantine TPC-DS q85 because SUBSTR(CHAR) now works
  • Christina Wallin (committed by Christina Wallin) 2f48e86 Extract HttpClientHandler from TestExchangeOperator
  • Christina Wallin (committed by Christina Wallin) 32f6848 Add docs for Kerberos principal access control
  • Christina Wallin (committed by Christina Wallin) 4329b18 fixup! Update MultiJoinNodeBuilder to limit the number of sources
  • Christina Wallin (committed by Christina Wallin) 58fa0b8 Migrate PickLayout to the iterative optimizer
  • Christina Wallin (committed by Christina Wallin) 5b19ec3 Add ordering to RemoteSourceNode and ExchangeNode
  • Christina Wallin (committed by Christina Wallin) 89947ad Add Kerberos exact match to file access control
  • Christina Wallin (committed by Christina Wallin) 8e939c3 Planner part of Merge
  • Christina Wallin (committed by Christina Wallin) 9b8fd28 Expose extra information from RuleTester
  • Christina Wallin (committed by Christina Wallin) aa52225 Extract TableLayout rewrite to a separate class
  • Christina Wallin (committed by Christina Wallin) b51d6c4 Add tests for long prepared statements
  • Christina Wallin (committed by Christina Wallin) b8de2af Update MultiJoinNodeBuilder to limit the number of sources
  • Christina Wallin (committed by Christina Wallin) c5b09aa Modify CLI to support long prepared statements
  • Christina Wallin (committed by Christina Wallin) c65de0b Support long prepared statements in QueryResults
  • Christina Wallin (committed by Christina Wallin) d12fc3d Prepare for next development iteration
  • Christina Wallin (committed by Christina Wallin) e3e1bc0 Add TableLayout to PlanBuilder
  • David Phillips (committed by Christina Wallin) 0d5a059 Use older Travis image
  • David Phillips (committed by Christina Wallin) 312f0b2 Allow configuring JDBC HTTP client per connection
  • David Phillips (committed by Christina Wallin) 3dd914f Extract base class for LDAP JDBC tests
  • David Phillips (committed by Christina Wallin) 4096870 Use OkHttp for CLI and JDBC driver
  • David Phillips (committed by Christina Wallin) 5c5a9c9 Simplify assertions in TestPrestoDriverUri
  • David Phillips (committed by Christina Wallin) 83506ff Add Kerberos support for JDBC driver
  • David Phillips (committed by Christina Wallin) b0c1fe1 Add SSL LDAP product tests for Presto JDBC driver
  • David Phillips (committed by Christina Wallin) ceee973 Add HTTP proxy support for JDBC driver and CLI
  • David Phillips (committed by Christina Wallin) efc80a3 Add SOCKS proxy support for JDBC driver
  • Eric Diven (committed by Christina Wallin) 2eca484 Add support for SSL in JDBC driver
  • Eric Diven (committed by Christina Wallin) 51f0b8f Improve properties/URL parameter handling
  • Eric Diven (committed by Christina Wallin) e6ea41b Document encrypted internal communication

Grzegorz Kokosinski

  • Grzegorz Kokosiński (committed by Andrzej Fiedukowicz) 0e7b37e Use Set collection to store stats estimation rules
  • Grzegorz Kokosiński (committed by Andrzej Fiedukowicz) 1023df2 Use pattern matching in ComposableStatsCalculator
  • Grzegorz Kokosiński (committed by Andrzej Fiedukowicz) 11812a0 Add UnionStatsRule
  • Grzegorz Kokosiński (committed by Andrzej Fiedukowicz) 1cfdc4f Cap distinct values count to ouptut rows count
  • Grzegorz Kokosiński (committed by Andrzej Fiedukowicz) 4fa9c7c Make StatsCalculaterTester to be closeable
  • Grzegorz Kokosiński (committed by Andrzej Fiedukowicz) 67b29c8 Ensure all output symbols have stats estimates
  • Grzegorz Kokosiński (committed by Andrzej Fiedukowicz) 68e0993 Introduce stats Normalizer
  • Grzegorz Kokosiński (committed by Andrzej Fiedukowicz) 6f8b774 Remove display-only stats tests
  • Grzegorz Kokosiński (committed by Andrzej Fiedukowicz) 71d5c4e Support multi source exchange
  • Grzegorz Kokosiński (committed by Andrzej Fiedukowicz) 787511e Add scalar stats estimation for CoalesceExpression
  • Grzegorz Kokosiński (committed by Andrzej Fiedukowicz) 8684b49 Cap distinct values count to type domain range length
  • Grzegorz Kokosiński (committed by Andrzej Fiedukowicz) 8d05a3f Use TreeTraverser in MatchingEngine
  • Grzegorz Kokosiński (committed by Andrzej Fiedukowicz) acba057 Use nearlyEquals for estimate comparison
  • Grzegorz Kokosiński (committed by Andrzej Fiedukowicz) cb0e313 Add statistic estimation for simple AggregationNode
  • Grzegorz Kokosiński (committed by Andrzej Fiedukowicz) d06d384 Test for cost/stats calculation done by CoefficientBasedStatsCalculator
  • Grzegorz Kokosiński (committed by Andrzej Fiedukowicz) d35f1d7 Add scalar stats estimation for ArithmeticBinaryExpression
  • Grzegorz Kokosiński (committed by Andrzej Fiedukowicz) d8a55cb Move pattern matching to separate package
  • Grzegorz Kokosiński (committed by Andrzej Fiedukowicz) e58f0a2 Check interfaces of given object when checking pattern
  • Grzegorz Kokosiński (committed by Andrzej Fiedukowicz) ffcb0eb Add mapping functions to SymbolStatsEstimate
  • Grzegorz Kokosiński (committed by Grzegorz Kokosiński) 063d3ba fixup! Add stats calculation for FilterNode comparisons
  • Grzegorz Kokosiński (committed by Grzegorz Kokosiński) 1f1fb86 fixup! Introduce CostCalculator interface
  • Grzegorz Kokosiński (committed by Grzegorz Kokosiński) 5431913 Add test for join ordering for part of TPCH q2
  • Grzegorz Kokosiński (committed by Grzegorz Kokosiński) 7410860 fixup! Introduce CostCalculator interface
  • Grzegorz Kokosiński (committed by Grzegorz Kokosiński) 79ede59 fixup! Introduce caching cost and stats calculator
  • Grzegorz Kokosiński (committed by Grzegorz Kokosiński) a77194f Rename TestReorderJoins to TestElminateCrossJoins
  • Grzegorz Kokosiński (committed by Grzegorz Kokosiński) ada21fb Add IntersectStatsRule
  • Grzegorz Kokosiński (committed by Grzegorz Kokosiński) e46571d fixup! Introduce CostCalculator interface
  • Grzegorz Kokosiński (committed by Grzegorz Kokosiński) ee9b9f2 fixup! Add UnionStatsRule
  • Grzegorz Kokosiński (committed by Grzegorz Kokosiński) ff66793 fixup! Add JoinStatsRule to support equi-conditions and filters
  • Grzegorz Kokosiński (committed by Łukasz Osipiuk) 0bb5981 Expose TPCDS statistics
  • Grzegorz Kokosiński (committed by Łukasz Osipiuk) 4ab5e1a Check checksum of LookupSourceSupplier in HashBuilderOperator
  • Grzegorz Kokosiński (committed by Łukasz Osipiuk) 59034a4 Port TPCH statistics recorder to TPCDS
  • Grzegorz Kokosiński (committed by Łukasz Osipiuk) 7e0e2f7 Calculate checksum of spilled LookupSource
  • Grzegorz Kokosiński (committed by Łukasz Osipiuk) 87abbea Extend and refactor TestReorderJoins to verify just join order
  • Grzegorz Kokosiński (committed by Łukasz Osipiuk) ad4673e Close spill files as soon as they are read
  • Grzegorz Kokosiński (committed by Łukasz Osipiuk) b1b9858 TPCDS statistics data
  • Grzegorz Kokosiński (committed by Łukasz Osipiuk) c102dd7 Return TPCDS statistics values in Presto internal representation

Karol Sobczak

  • Karol Sobczak (committed by Akshat Nair) 532e068 Fix the TODO in LimitStatsRule
  • Karol Sobczak (committed by Andrzej Fiedukowicz) 5cef6b1 Add verifyExactColumnStatistics utility method to StatisticsAssertion
  • Karol Sobczak (committed by Andrzej Fiedukowicz) bc5fd28 Add JoinStatsRule to support equi-conditions and filters
  • Karol Sobczak (committed by Karol Sobczak) 014853b Adjust PushPartialAggregationThroughExchange to sprint branch
  • Karol Sobczak (committed by Karol Sobczak) 0e7ae58 Migrate PushPartialAggregationThroughJoin as a rule
  • Karol Sobczak (committed by Karol Sobczak) 0fd980c Migrate PushPartialAggregationThroughExchange as a rule
  • Karol Sobczak (committed by Karol Sobczak) 12285d6 fixup! Introduce CostCalculator interface
  • Karol Sobczak (committed by Karol Sobczak) 13943d9 Handle symbol = symbol explicitly in stats
  • Karol Sobczak (committed by Karol Sobczak) 29d87e1 Adjust PushPartialAggregationThroughJoin to sprint branch
  • Karol Sobczak (committed by Karol Sobczak) 30596a5 Skip PushPartialAggregationThroughJoin when hash symbol is present
  • Karol Sobczak (committed by Karol Sobczak) 3915b3c Preserve symbols required by join during partial aggregation pushdown
  • Karol Sobczak (committed by Karol Sobczak) 4fb2b49 fixup! Support multi source exchange
  • Karol Sobczak (committed by Karol Sobczak) 52a56db fixup! Stats calculator for simple Filter+Project+SemiJoin
  • Karol Sobczak (committed by Karol Sobczak) 59e04c6 Restrict output of join after partial aggregation pushdown
  • Karol Sobczak (committed by Karol Sobczak) 5ea92f3 fixup! Add JoinStatsRule to support equi-conditions and filters
  • Karol Sobczak (committed by Karol Sobczak) 708e37b Introduce caching cost and stats calculator
  • Karol Sobczak (committed by Karol Sobczak) 813d19f Adjust PlanOptimizers to sprint branch
  • Karol Sobczak (committed by Karol Sobczak) 82368dd Remove remapping of pushed partial aggregation
  • Karol Sobczak (committed by Karol Sobczak) 89ca2ff fixup! Cap distinct values count to ouptut rows count
  • Karol Sobczak (committed by Karol Sobczak) 8a8736b Add rule test for PushPartialAggregationThroughJoin
  • Karol Sobczak (committed by Karol Sobczak) 994af68 Remove @threadsafe annotation from CostCalculator interface
  • Karol Sobczak (committed by Karol Sobczak) a4b7e43 Adjust TestPushPartialAggregationThroughJoin to sprint branch
  • Karol Sobczak (committed by Karol Sobczak) ac06bde Add pcollections library dependency
  • Karol Sobczak (committed by Karol Sobczak) bf72dee Estimate anti join selectivity by least-selective equi-join-clause
  • Karol Sobczak (committed by Karol Sobczak) c017180 Use HashTreePMap in PlanNodeStatsEstimate to reduce map copying
  • Karol Sobczak (committed by Karol Sobczak) c061372 fixup! Add UnionStatsRule
  • Karol Sobczak (committed by Karol Sobczak) c5dd64b Add support for filter functions in partial aggregation pushdown
  • Karol Sobczak (committed by Karol Sobczak) d34fed1 Use stats calculator that is join aware in ReorderJoins
  • Karol Sobczak (committed by Karol Sobczak) ea75972 Remove legacy PartialAggregationPushDown optimizer
  • Karol Sobczak (committed by Łukasz Osipiuk) 99cfd1a fixup! Add JoinStatsRule to support equi-conditions and filters

Lukasz Osipiuk

  • Lukasz Osipiuk (committed by Andrzej Fiedukowicz) 0624f1e Expose min/max value in statistics for Hive tables
  • Lukasz Osipiuk (committed by Andrzej Fiedukowicz) 1007f44 Allow using ComposableStatsCalculator instead of CoefficientBasedStatsCalculator
  • Lukasz Osipiuk (committed by Andrzej Fiedukowicz) 1332459 Add stats calculator unit testing framework
  • Lukasz Osipiuk (committed by Andrzej Fiedukowicz) 18d975d Add support for low/high values in StatisticsAssertion
  • Lukasz Osipiuk (committed by Andrzej Fiedukowicz) 194d017 Add scalar stats estimation for SymbolReference
  • Lukasz Osipiuk (committed by Andrzej Fiedukowicz) 1ad351c Remove explicit data_size from PlanNodeStatsEstimate
  • Lukasz Osipiuk (committed by Andrzej Fiedukowicz) 1e96d34 Add OutputStatsRule
  • Lukasz Osipiuk (committed by Andrzej Fiedukowicz) 21edd86 Add doc for statistics
  • Lukasz Osipiuk (committed by Andrzej Fiedukowicz) 27d7c13 Add TestTpchLocalStats stub
  • Lukasz Osipiuk (committed by Andrzej Fiedukowicz) 40b1f35 Add ProjectStatsRule (TODO)
  • Lukasz Osipiuk (committed by Andrzej Fiedukowicz) 4750ea4 Introduce range column statistics
  • Lukasz Osipiuk (committed by Andrzej Fiedukowicz) 59e3e07 Replace nulls count with nulls fraction in column statistics
  • Lukasz Osipiuk (committed by Andrzej Fiedukowicz) 63a3a40 Add ColumnStatistics.UNKNOWN_COLUMN_STATISTICS
  • Lukasz Osipiuk (committed by Andrzej Fiedukowicz) 950c7f7 Add support for LocalQueryRunner in StatisticsAssertion
  • Lukasz Osipiuk (committed by Andrzej Fiedukowicz) 9546667 Add PlanNodeStatisticsAssertion
  • Lukasz Osipiuk (committed by Andrzej Fiedukowicz) 9af6ed2 Add rule for computing stats for ValuesNode
  • Lukasz Osipiuk (committed by Andrzej Fiedukowicz) a56926b Add LimitStatsRule (TODO)
  • Lukasz Osipiuk (committed by Andrzej Fiedukowicz) a605718 Implement ExchangeStatsRule for single source
  • Lukasz Osipiuk (committed by Andrzej Fiedukowicz) a9dbeab Add ComposableStatsCalculator
  • Lukasz Osipiuk (committed by Andrzej Fiedukowicz) b535bd8 Return low/high value in show stats
  • Lukasz Osipiuk (committed by Andrzej Fiedukowicz) b886e45 Add TableScanStatsRule
  • Lukasz Osipiuk (committed by Andrzej Fiedukowicz) d66506f Add OutputNode support to PlanBuilder
  • Lukasz Osipiuk (committed by Andrzej Fiedukowicz) ef31d70 Allow passing external QueryRunner to RuleTester
  • Lukasz Osipiuk (committed by Andrzej Fiedukowicz) f0af2e3 Add clearRanges to ColumnStatistics.builder()
  • Lukasz Osipiuk (committed by Andrzej Fiedukowicz) f0e5e32 Add scalar stats estimation for Cast
  • Lukasz Osipiuk (committed by Andrzej Fiedukowicz) f51edee Support column stats in StatisticsAssertion
  • Lukasz Osipiuk (committed by Łukasz Osipiuk) 1c7f4da Add doc for multi-file hive buckets support
  • Lukasz Osipiuk (committed by Łukasz Osipiuk) 293a7cb Refactor OperatorContext::reserveMemory
  • Lukasz Osipiuk (committed by Łukasz Osipiuk) 327159b Add tests for bucketed table with empty partitions
  • Lukasz Osipiuk (committed by Łukasz Osipiuk) 4b62042 Allow multiple files per bucket in hive
  • Lukasz Osipiuk (committed by Łukasz Osipiuk) 6f29bc6 Cleanup TestMemoryPools test
  • Lukasz Osipiuk (committed by Łukasz Osipiuk) 74b7a8a Allow empty partitions for clustered hive tables
  • Lukasz Osipiuk (committed by Łukasz Osipiuk) 8f20152 Add revoking tests to TestMemoryPools
  • Lukasz Osipiuk (committed by Łukasz Osipiuk) 9b42c26 Improve exception messages
  • Lukasz Osipiuk (committed by Łukasz Osipiuk) af86e72 Introduce MemoryRevokingScheduler
  • Lukasz Osipiuk (committed by Łukasz Osipiuk) c8362a4 Make memory pool aware of revocable allocation type
  • Lukasz Osipiuk (committed by Łukasz Osipiuk) f20490f Add doc for Hive empty buckets support
  • Lukasz Osipiuk (committed by Łukasz Osipiuk) f3c9dec Introduce QueryContextVisitor
  • Nezih Yigitbasi (committed by Łukasz Osipiuk) 8206d70 Synchronize access to reservedBytes and reservedRevocableBytes
  • Łukasz Osipiuk (committed by Andrzej Fiedukowicz) 0ea1973 Remove Filter above TableScan logic from CoefficientBasedStatsCalculator
  • Łukasz Osipiuk (committed by Andrzej Fiedukowicz) 0fa14a9 Add SymbolStatsAssertion.isEqualTo
  • Łukasz Osipiuk (committed by Andrzej Fiedukowicz) 381648b Add precondition checks in PlanNodeStatsEstimate constructor
  • Łukasz Osipiuk (committed by Andrzej Fiedukowicz) 3d5a4b4 Use doubles instead Estimates in PlanNodeStatsEstimate
  • Łukasz Osipiuk (committed by Andrzej Fiedukowicz) 41db4f1 Rename cost concept to stats
  • Łukasz Osipiuk (committed by Andrzej Fiedukowicz) 51b4b2f Add test for symbol reference stats calculation
  • Łukasz Osipiuk (committed by Andrzej Fiedukowicz) 9dd58ef Do not check symobol statistics in PlanStatsMatcher
  • Łukasz Osipiuk (committed by Andrzej Fiedukowicz) b656222 Add EnforceSingleRowStatsRule (TODO)
  • Łukasz Osipiuk (committed by Andrzej Fiedukowicz) bba4a5a Add ScalarStatsCalculator
  • Łukasz Osipiuk (committed by Andrzej Fiedukowicz) c13d897 TestStatsCalculator tests new version of calculator
  • Łukasz Osipiuk (committed by Andrzej Fiedukowicz) c67f15f Add scalar stats estimation for Literal
  • Łukasz Osipiuk (committed by Andrzej Fiedukowicz) d22925f Add Symbol statistics to PlanNodeStatsEstimate
  • Łukasz Osipiuk (committed by Łukasz Osipiuk) 0105c5b Estimate JOIN selectivity by most selective equi-join-clause
  • Łukasz Osipiuk (committed by Łukasz Osipiuk) 07bb5ee Do not plan INNER join with just filter function
  • Łukasz Osipiuk (committed by Łukasz Osipiuk) 1df231f fixup! Propagate unknown expression stats filtering from leaf
  • Łukasz Osipiuk (committed by Łukasz Osipiuk) 3d5136d fixup! Add JoinStatsRule to support equi-conditions and filters
  • Łukasz Osipiuk (committed by Łukasz Osipiuk) 3e8e937 Change TestReorderJoins.q7
  • Łukasz Osipiuk (committed by Łukasz Osipiuk) 426f81c Fix ExpressionNotEqualExpression filter stats rule
  • Łukasz Osipiuk (committed by Łukasz Osipiuk) 4d4cbd8 Cap distinctValuesCount in intersect in in FilterStatsCalculator
  • Łukasz Osipiuk (committed by Łukasz Osipiuk) 503b126 Rename joinConditionBuilder variable in PredicatePushDown
  • Łukasz Osipiuk (committed by Łukasz Osipiuk) 540887b Propagate unknown expression stats filtering from leaf
  • Łukasz Osipiuk (committed by Łukasz Osipiuk) 8b85d86 Stats calculator for simple Filter+Project+SemiJoin
  • Łukasz Osipiuk (committed by Łukasz Osipiuk) a2ebcb0 Add simple implementation of SemiJoinStatsRule
  • Łukasz Osipiuk (committed by Łukasz Osipiuk) a729c14 Remove SATURATED_FLOOR_CAST real->[bigint,integer]
  • Łukasz Osipiuk (committed by Łukasz Osipiuk) b34fe31 fixup! Add spill for join
  • Łukasz Osipiuk (committed by Łukasz Osipiuk) b57a308 Remove SATURATED_FLOOR_CAST double->bigint
  • Łukasz Osipiuk (committed by Łukasz Osipiuk) b78219c fixup! Estimate JOIN selectivity by most selective equi-join-clause
  • Łukasz Osipiuk (committed by Łukasz Osipiuk) ce9a756 Remove SATURATED_FLOOR_CAST double->decimal
  • Łukasz Osipiuk (committed by Łukasz Osipiuk) d7d6237 Add simple SemiJoinStatsCalculator
  • Łukasz Osipiuk (committed by Łukasz Osipiuk) db32a14 Refactor checks for unchanges stats in TestComparisonStatsCalculator
  • Łukasz Osipiuk (committed by Łukasz Osipiuk) e872b1e fixup! Add JoinStatsRule to support equi-conditions and filters
  • Łukasz Osipiuk (committed by Łukasz Osipiuk) ea3d7b4 Fix planning issue of LEFT coin converted to INNER
  • Łukasz Osipiuk (committed by Łukasz Osipiuk) eddc386 Assume unknown value domain for types not representable as double
  • Łukasz Osipiuk (committed by Łukasz Osipiuk) ff5ba80 fixup! Add Symbol statistics to PlanNodeStatsEstimate

Maciej Grzybek

  • Maciej Grzybek (committed by Maciej Grzybek) 0473089 Make PlanNodeStats Mergeable
  • Maciej Grzybek (committed by Maciej Grzybek) 1393b48 Add documentation for EXPLAIN ANALYZE VERBOSE mode
  • Maciej Grzybek (committed by Maciej Grzybek) 251a124 Fix handling ConnectorSession in FunctionInvoker
  • Maciej Grzybek (committed by Maciej Grzybek) 2f24d55 Unify ExpressionInterpreter.invoke and FunctionInvoker code
  • Maciej Grzybek (committed by Maciej Grzybek) 535912a Remove ExpressionInterpreter.invoke, use FunctionInvoker instead
  • Maciej Grzybek (committed by Maciej Grzybek) 5dacc91 Pass FunctionRegistry instead of whole Metadata
  • Maciej Grzybek (committed by Maciej Grzybek) 60a3899 Add additional statistics for WindowOperator
  • Maciej Grzybek (committed by Maciej Grzybek) 6d303be Add VERBOSE option to EXPLAIN
  • Maciej Grzybek (committed by Maciej Grzybek) 88b5bf3 Auto-reformat the WindowOperator
  • Maciej Grzybek (committed by Maciej Grzybek) 93bcc63 Change expected type to be Object instead of Block in Accumulator
  • Maciej Grzybek (committed by Maciej Grzybek) 95c6acf Move Mergeable to Util package
  • Maciej Grzybek (committed by Maciej Grzybek) a946c1c Fix formatting in AccumulatorCompiler
  • Maciej Grzybek (committed by Maciej Grzybek) cd1ba79 Fix IntelliJ warnings in ExpressionInterpreter
  • Maciej Grzybek (committed by Maciej Grzybek) e153912 Add information about rows distribution per task to explain analyze
  • Maciej Grzybek (committed by Maciej Grzybek) ea44575 Add missing test for explain analyze with type specified
  • Maciej Grzybek (committed by Maciej Grzybek) f493e85 Prevent providing WindowInfo before the operator finishes
  • Maciej Grzybek (committed by Maciej Grzybek) f574127 Make Mergeable capable of merging Optionals

Piotr Findeisen

  • Nezih Yigitbasi (committed by Piotr Findeisen) c8b6e01 Fix flaky test TestPrestoDriver::testQueryTimeout()
  • Piotr Findeisen (committed by Christina Wallin) 0119258 Fix calls to HostAndPort.getHostText
  • Piotr Findeisen (committed by Piotr Findeisen) 1e0f33c fixup! Introduce CostComparator
  • Piotr Findeisen (committed by Piotr Findeisen) 26595d1 fixup! Introduce CostCalculator interface
  • Piotr Findeisen (committed by Piotr Findeisen) 280203a Provide thread safe AggregatedMemoryContext implementation
  • Piotr Findeisen (committed by Piotr Findeisen) 2b2757b fixup! Add PartitioningSpiller
  • Piotr Findeisen (committed by Piotr Findeisen) 2c956e5 fixup! Add doc for statistics
  • Piotr Findeisen (committed by Piotr Findeisen) 318c856 fixup! Add PartitioningSpiller
  • Piotr Findeisen (committed by Piotr Findeisen) 3333120 Improve IN predicate statistics
  • Piotr Findeisen (committed by Piotr Findeisen) 39aa485 fixup! Add doc for statistics
  • Piotr Findeisen (committed by Piotr Findeisen) 3dacf66 fixup! Add FilterStatsCalculator
  • Piotr Findeisen (committed by Piotr Findeisen) 420201a fixup! Provide thread safe AggregatedMemoryContext implementation
  • Piotr Findeisen (committed by Piotr Findeisen) 459a3c9 Generalize symbol-to-symbol comparisons to cover expressions
  • Piotr Findeisen (committed by Piotr Findeisen) 4e14575 fixup! Add stats calculation for FilterNode comparisons
  • Piotr Findeisen (committed by Piotr Findeisen) 4f9b233 Cache LookupSource to avoid construction cost
  • Piotr Findeisen (committed by Piotr Findeisen) 5620f30 fixup! Use stats calculator that is join aware in ReorderJoins
  • Piotr Findeisen (committed by Piotr Findeisen) 5b7753e fixup! TMP TODO notes for my dear team ☺︎
  • Piotr Findeisen (committed by Piotr Findeisen) 64f2c62 fixup! Add FilterStatsCalculator
  • Piotr Findeisen (committed by Piotr Findeisen) 6d80d8f fixup! Add stats calculation for FilterNode comparisons
  • Piotr Findeisen (committed by Piotr Findeisen) 6fd45b9 fixup! Add Symbol statistics to PlanNodeStatsEstimate
  • Piotr Findeisen (committed by Piotr Findeisen) 700df66 fixup! Add Symbol statistics to PlanNodeStatsEstimate
  • Piotr Findeisen (committed by Piotr Findeisen) 714117a Add a TODO note to FileSingleStreamSpillerFactory
  • Piotr Findeisen (committed by Piotr Findeisen) 79aada6 Test combination of IN and cast literals
  • Piotr Findeisen (committed by Piotr Findeisen) 7f652a2 fixup! Add stats calculation for FilterNode comparisons
  • Piotr Findeisen (committed by Piotr Findeisen) 7f6c6af fixup! Add stats calculation for FilterNode comparisons
  • Piotr Findeisen (committed by Piotr Findeisen) 932263a fixup! Add stats calculation for FilterNode comparisons
  • Piotr Findeisen (committed by Piotr Findeisen) 936698d fixup! Rename cost concept to stats
  • Piotr Findeisen (committed by Piotr Findeisen) 97a42c3 fixup! Add Symbol statistics to PlanNodeStatsEstimate
  • Piotr Findeisen (committed by Piotr Findeisen) a63bdbb fixup! Add stats calculation for FilterNode comparisons
  • Piotr Findeisen (committed by Piotr Findeisen) ad1ba3f fixup! Add PartitioningSpiller
  • Piotr Findeisen (committed by Piotr Findeisen) adfc7ec fixup! Add stats calculation for FilterNode logical operations
  • Piotr Findeisen (committed by Piotr Findeisen) aebfe28 Merge PR #637 'statistics generalized comparisons' into release-0.179-t
  • Piotr Findeisen (committed by Piotr Findeisen) b8a32d7 Use generalized expression-to-expression comparison stats calculations
  • Piotr Findeisen (committed by Piotr Findeisen) b8c7059 Use generalized expression-to-literal comparison stats calculations
  • Piotr Findeisen (committed by Piotr Findeisen) bdf290c fixup! Add PartitioningSpiller
  • Piotr Findeisen (committed by Piotr Findeisen) bfcf228 Expose aggregation keys that are also source fields to ORDER BY
  • Piotr Findeisen (committed by Piotr Findeisen) c04e40e fixup! Introduce CostComparator
  • Piotr Findeisen (committed by Piotr Findeisen) c4c23aa fixup! Add stats calculation for FilterNode logical operations
  • Piotr Findeisen (committed by Piotr Findeisen) d49e611 fixup! Change StatsCalculator API to use Lookup for computing child costs
  • Piotr Findeisen (committed by Piotr Findeisen) e284933 fixup! Add PartitioningSpiller
  • Piotr Findeisen (committed by Piotr Findeisen) e717a74 Generalize symbol-to-literal comparisons to cover expressions
  • Piotr Findeisen (committed by Piotr Findeisen) eac8b1a Consider ranges overlap when stating symbols comparison
  • Piotr Findeisen (committed by Piotr Findeisen) edd6a6c FIXUP don't create new local Spill context when invoking spiller factory :/
  • Piotr Findeisen (committed by Piotr Findeisen) f871f28 fixup! Add stats calculation for FilterNode comparison related operators
  • Piotr Findeisen (committed by Rebecca Schlussel) 00245ff Fix date constant in test
  • Piotr Findeisen (committed by Rebecca Schlussel) 26baf61 Test stat calculations for DATE comparisons
  • Piotr Findeisen (committed by Rebecca Schlussel) 33670b9 Construct proper literal for DATE in LiteralInterpreter.toExpression
  • Piotr Findeisen (committed by Rebecca Schlussel) 7b0b4b3 Support DATE to double conversion in DomainConverter
  • Piotr Findeisen (committed by Łukasz Osipiuk) 05a8abc Ensure everything is closed in SpillableHashAggregationBuilder
  • Piotr Findeisen (committed by Łukasz Osipiuk) 07be20e Refactor LookupJoinOperator so that it doesn't keep LookupSource
  • Piotr Findeisen (committed by Łukasz Osipiuk) 0cac9cd Fix formatting
  • Piotr Findeisen (committed by Łukasz Osipiuk) 0fe3785 Add factory method for constructing dictionary pages
  • Piotr Findeisen (committed by Łukasz Osipiuk) 10adfb6 Trigger memory revoking quickly on over-allocation
  • Piotr Findeisen (committed by Łukasz Osipiuk) 11490b0 Rename PartitionFunction to BucketPartitionFunction
  • Piotr Findeisen (committed by Łukasz Osipiuk) 1769bda fixup! Calculate checksum of spilled LookupSource
  • Piotr Findeisen (committed by Łukasz Osipiuk) 21db4d3 Extract join operator factory creation in TestHashJoinOperator
  • Piotr Findeisen (committed by Łukasz Osipiuk) 2e6e457 [superseded by airlift/airlift#534] Add utility functions aiding Operator implementations
  • Piotr Findeisen (committed by Łukasz Osipiuk) 2f5c6d5 Add spill for join
  • Piotr Findeisen (committed by Łukasz Osipiuk) 306362e [spilled to prestodb#8263] Fix thread-safety of SpillContexts [...]
  • Piotr Findeisen (committed by Łukasz Osipiuk) 3bdc506 fixup! Cap distinct values count to type domain range length
  • Piotr Findeisen (committed by Łukasz Osipiuk) 3f2d318 Do not print configuration in tests to avoid Travis log limit
  • Piotr Findeisen (committed by Łukasz Osipiuk) 4557257 Change memory revoking finish to continuations
  • Piotr Findeisen (committed by Łukasz Osipiuk) 45c102b Keep calling build driver in TestHashJoinOperator after build is complete
  • Piotr Findeisen (committed by Łukasz Osipiuk) 47ed4c7 Add helper class for operating on Optionals
  • Piotr Findeisen (committed by Łukasz Osipiuk) 4cc0b06 fixup! Add spill for join
  • Piotr Findeisen (committed by Łukasz Osipiuk) 4dadbac Prevent file leak in FileSingleStreamSpiller
  • Piotr Findeisen (committed by Łukasz Osipiuk) 4f53cbd Lazy create partition spillers to avoid spilling locations skew
  • Piotr Findeisen (committed by Łukasz Osipiuk) 4f7d73e TODO markers
  • Piotr Findeisen (committed by Łukasz Osipiuk) 5a30760 TMP TODO notes for my dear team ☺︎
  • Piotr Findeisen (committed by Łukasz Osipiuk) 727510c [spilled to prestodb#8236] Move providing OuterPositionIterator to LookupSourceFactory ...
  • Piotr Findeisen (committed by Łukasz Osipiuk) 740e37a Extract TaskManagementExecutor from SqlTaskManager
  • Piotr Findeisen (committed by Łukasz Osipiuk) 7478eb9 squash! Add spill for join
  • Piotr Findeisen (committed by Łukasz Osipiuk) 7b77516 fixup! TMP test more
  • Piotr Findeisen (committed by Łukasz Osipiuk) 7b911e5 TMP test more
  • Piotr Findeisen (committed by Łukasz Osipiuk) 800f56e Update memory allocation when finishing in HashAggregationOperator
  • Piotr Findeisen (committed by Łukasz Osipiuk) 832781e Make LocalPartitionGenerator implement PartitionFunction
  • Piotr Findeisen (committed by Łukasz Osipiuk) 83c65ba fixup! Assume unknown value domain for types not representable as double
  • Piotr Findeisen (committed by Łukasz Osipiuk) 88c4a30 Refactor supplying partition to PartitionedLookupSourceFactory
  • Piotr Findeisen (committed by Łukasz Osipiuk) 8a31b19 Make PagesIndex.clear() free memory
  • Piotr Findeisen (committed by Łukasz Osipiuk) 905abd1 Reliably call Operator.finish() even if blocked
  • Piotr Findeisen (committed by Łukasz Osipiuk) 9452f3b fixup! Lazy create partition spillers to avoid spilling locations skew
  • Piotr Findeisen (committed by Łukasz Osipiuk) 9d7dcd9 Remove redundant parameter from DictionaryBlock constructor calls
  • Piotr Findeisen (committed by Łukasz Osipiuk) ba4c07a TODO marker
  • Piotr Findeisen (committed by Łukasz Osipiuk) bb5337c fixup! Change memory revoking finish to continuations
  • Piotr Findeisen (committed by Łukasz Osipiuk) c3f3924 fixup! Check checksum of LookupSourceSupplier in HashBuilderOperator
  • Piotr Findeisen (committed by Łukasz Osipiuk) cb1cf99 Change PartitionedLookupSourceFactory synchronization to RW-lock
  • Piotr Findeisen (committed by Łukasz Osipiuk) d0f4973 Add minimal DictionaryBlock constructor
  • Piotr Findeisen (committed by Łukasz Osipiuk) d1522e0 Separate build side setup and execution in TestHashJoinOperator
  • Piotr Findeisen (committed by Łukasz Osipiuk) d20d691 Reorder methods in HashBuildOperator to match lifecycle
  • Piotr Findeisen (committed by Łukasz Osipiuk) e689f6f Fix PagesIndex.getEstimatedSize for empty index
  • Piotr Findeisen (committed by Łukasz Osipiuk) e68bf84 Try index compaction before spilling in HashBuilderOperator
  • Piotr Findeisen (committed by Łukasz Osipiuk) f612f51 fixup! Consider ranges overlap when stating symbols comparison
  • Piotr Findeisen (committed by Łukasz Osipiuk) f7014ba Refactor JoinProbe so that it doesn't keep LookupSource

Piotr Nowojski

  • Piotr Nowojski (committed by Andrzej Fiedukowicz) 56ac3a9 Estimate default outputSizeInBytes based on outputRows
  • Piotr Nowojski (committed by Andrzej Fiedukowicz) 8c7e641 Print cost estimate in Explain
  • Piotr Nowojski (committed by Andrzej Fiedukowicz) a177256 Add information about most important performance releated properties
  • Piotr Nowojski (committed by Andrzej Fiedukowicz) aa11993 Introduce CostComparator
  • Piotr Nowojski (committed by Andrzej Fiedukowicz) bd5084c Add tuning for queries with fast processing splits
  • Piotr Nowojski (committed by Andrzej Fiedukowicz) c85d7e8 Refactor getQueryMaxMemory session property getter
  • Piotr Nowojski (committed by Andrzej Fiedukowicz) ecf8540 Add spilling properties documentation
  • Piotr Nowojski (committed by Andrzej Fiedukowicz) f12281c Introduce CostCalculator interface
  • Piotr Nowojski (committed by Rebecca Schlussel) dcb412b Document join reordering
  • Piotr Nowojski (committed by Łukasz Osipiuk) 02f0ed5 Always revoke memory in TestHashAggregationOperator
  • Piotr Nowojski (committed by Łukasz Osipiuk) 2e09de6 Revocable system memory in contexts
  • Piotr Nowojski (committed by Łukasz Osipiuk) 5094f5d Configure spill space limits in LocalQueryRunner
  • Piotr Nowojski (committed by Łukasz Osipiuk) 5a15eea Handle operator memory revoking in Driver
  • Piotr Nowojski (committed by Łukasz Osipiuk) 622a52c Ensure constant number of LookupJoinOperators
  • Piotr Nowojski (committed by Łukasz Osipiuk) 75ec07f Add custom TaskContext builder
  • Piotr Nowojski (committed by Łukasz Osipiuk) 849218c Trigger memory revoking in OperatorAssertions
  • Piotr Nowojski (committed by Łukasz Osipiuk) 867a885 Pass spilling parameters to HashBuildOperator & LookupJoinOperator
  • Piotr Nowojski (committed by Łukasz Osipiuk) 983261e Always revoke memory in TestQuerySpillLimits
  • Piotr Nowojski (committed by Łukasz Osipiuk) 9c79bae Use revocable memory for spilling in aggregation
  • Piotr Nowojski (committed by Łukasz Osipiuk) a9657dc Add getPages method to PagesIndex
  • Piotr Nowojski (committed by Łukasz Osipiuk) e970b6e Always revoke memory in TestLocalBinarySpilledQueries
  • Piotr Nowojski (committed by Łukasz Osipiuk) ee86bc3 Rename memoryLimitBeforeSpill config/session property
  • Piotr Nowojski (committed by Łukasz Osipiuk) f14778c Add PartitioningSpiller

Rebecca Schlussel

  • Rebecca Schlussel (committed by Andrzej Fiedukowicz) 08e768d Add ATQ test for table constraints infinite loop bug
  • Rebecca Schlussel (committed by Andrzej Fiedukowicz) 3973aad Add rule to push down table constraints
  • Rebecca Schlussel (committed by Andrzej Fiedukowicz) 76c39c5 Change StatsCalculator API to use Lookup for computing child costs
  • Rebecca Schlussel (committed by Anton Petrov) 636a5ee Add a test for the TPC-DS connector
  • Rebecca Schlussel (committed by Anton Petrov) fdba460 Add the data generation portion of the TPC-DS connector
  • Rebecca Schlussel (committed by Rebecca Schlussel) 00c9133 Revert "Quarantine TestReorderJoins tests depending on cost calculation details"
  • Rebecca Schlussel (committed by Rebecca Schlussel) 1a8d656 Fixup! Introduce CostComparator
  • Rebecca Schlussel (committed by Rebecca Schlussel) 2209bbb Special case NOT(IS NULL) predicate stats
  • Rebecca Schlussel (committed by Rebecca Schlussel) 23021c1 Add docs for query performance analysis
  • Rebecca Schlussel (committed by Rebecca Schlussel) 232bd27 Fixup! Add stats calculation for filter range comparisons
  • Rebecca Schlussel (committed by Rebecca Schlussel) 239b6b4 Add tests for s3 connector
  • Rebecca Schlussel (committed by Rebecca Schlussel) 2969319 Replace reorder_joins with join_reordering_strategy
  • Rebecca Schlussel (committed by Rebecca Schlussel) 2b36a8b Support using a fake node count for unit tests
  • Rebecca Schlussel (committed by Rebecca Schlussel) 3b4e2b3 Add ReorderJoins rule to pick the best join order
  • Rebecca Schlussel (committed by Rebecca Schlussel) 41e5fcc Add benchmark for ReorderJoins rule
  • Rebecca Schlussel (committed by Rebecca Schlussel) 4801614 Replace distributed_joins with join_distribution_type session property
  • Rebecca Schlussel (committed by Rebecca Schlussel) 4f1f05d Add unit testing infra for TableWriter and Union
  • Rebecca Schlussel (committed by Rebecca Schlussel) 539178b Revert "Test that CostCalculator successfully returns unknown costs for no stats"
  • Rebecca Schlussel (committed by Rebecca Schlussel) 671afb2 Add back improvements to fragment PlanPrinter
  • Rebecca Schlussel (committed by Rebecca Schlussel) 67e24c2 Support matching join distribution type in tests
  • Rebecca Schlussel (committed by Rebecca Schlussel) 682fed5 Document the push_aggregation_through_join property
  • Rebecca Schlussel (committed by Rebecca Schlussel) 6c1515d Revert "fixup! Introduce CostCalculator interface"
  • Rebecca Schlussel (committed by Rebecca Schlussel) 783c67e Revert "Make unknown costs the default in PlanNodeCostEstimate.Builder"
  • Rebecca Schlussel (committed by Rebecca Schlussel) 797c72f Revert "Add TestCostCalculator.CostAssertionBuilder#cpu/network/memoryUnknown()"
  • Rebecca Schlussel (committed by Rebecca Schlussel) 84d7d47 Support inserting stats for plan unit tests
  • Rebecca Schlussel (committed by Rebecca Schlussel) 89ccda3 Don't sum distinct values counts in UnionStatsRule
  • Rebecca Schlussel (committed by Rebecca Schlussel) 8a871d0 Revert "fixup! Introduce CostCalculator interface"
  • Rebecca Schlussel (committed by Rebecca Schlussel) 93f0951 Revert "fixup! Add ReorderJoins rule to pick the best join order"
  • Rebecca Schlussel (committed by Rebecca Schlussel) 9a75c16 Migrate PushTableWriteThroughUnion
  • Rebecca Schlussel (committed by Rebecca Schlussel) b834efb Revert "fixup! Remove explicit data_size from PlanNodeStatsEstimate"
  • Rebecca Schlussel (committed by Rebecca Schlussel) bce3573 Make binaryExpression() handle empty list
  • Rebecca Schlussel (committed by Rebecca Schlussel) bd2e5e2 Revert "fixup! Introduce CostComparator"
  • Rebecca Schlussel (committed by Rebecca Schlussel) c9b6e76 Add documentation for prepared statements
  • Rebecca Schlussel (committed by Rebecca Schlussel) cc770a1 Cap aggregation row count at input row count
  • Rebecca Schlussel (committed by Rebecca Schlussel) d1b701a Support passing statsCalculator to RuleAssert
  • Rebecca Schlussel (committed by Rebecca Schlussel) dd980ed Revert "fixup! Introduce CostCalculator interface"
  • Rebecca Schlussel (committed by Rebecca Schlussel) e0fb514 Rewrite join filters in SimplifyExpressions
  • Rebecca Schlussel (committed by Rebecca Schlussel) fdb77ee Add methods to flip join and set distribution type

Sanjay Sharma

  • Sanjay Sharma (committed by Sanjay Sharma) 0676c09 Update docker images version to 20
  • Sanjay Sharma (committed by Sanjay Sharma) 842e314 Update the version of hadoop-apache2
  • Sanjay Sharma (committed by Sanjay Sharma) 8d9bd25 Fix failing test for SQL Server
  • Sanjay Sharma (committed by Sanjay Sharma) 993095a Fix error message with invalid catalog and schema
  • Sanjay Sharma (committed by Sanjay Sharma) b06e036 Add configuration variables for authenticating to Azure Blob

Szymon Matejczyk

  • Szymon Matejczyk (committed by Łukasz Osipiuk) 50f1f4d Extend TestReorderJoins with all TPC-H queries

Wojciech Biela

  • Wojciech Biela (committed by Wojciech Biela) 4ae6be6 Add spilled data size to EXPLAIN ANALYZE
  • Wojciech Biela (committed by Wojciech Biela) 6b176a9 Add spilled data size to query summary debug
  • Wojciech Biela (committed by Wojciech Biela) aa29fe1 Add spilled data size to Web UI
  • Wojciech Biela (committed by Wojciech Biela) d0bcf4b Add spilled data size to QueryStats

Support INSERT for Cassandra connector

Dear team

We are trying to add support for INSERT to Cassandra connector,
because we have use cases which select data from HDFS and then insert into Cassandra.
Hope you can help us to complete this issue.

Timestamp time zone handling appears broken in ORC format for 0.167-t.0.2

Teradata Presto release appears to be deserializing the timestamp column incorrectly in ORC files with time zone information. (ORC written from Hive 1.2.1)

Note the query returns correctly in standard Presto and in Hive. The table was built via Hive, transforming the first column, a string, into a timestamp column; the local time zone of Hive/Presto is America/Los_Angeles.

Teradata Presto 0.167-t.0.2:
time         |           tm            |                    _col2
---------------------+-------------------------+---------------------------------------------
 01-30-2016-00:11:02 | 2016-01-30 08:11:02.000 | 2016-01-30 07:11:02.000 America/Los_Angeles
 01-30-2016-00:39:28 | 2016-01-30 08:39:28.000 | 2016-01-30 07:39:28.000 America/Los_Angeles


Mini cluster (0.17x)
time         |           tm            |                    _col2
---------------------+-------------------------+---------------------------------------------
 01-30-2016-00:11:02 | 2016-01-30 00:11:02.000 | 2016-01-30 00:11:02.000 America/Los_Angeles
 01-30-2016-00:39:28 | 2016-01-30 00:39:28.000 | 2016-01-30 00:39:28.000 America/Los_Angeles

presto:mesoads> select time,tm, tm at time zone 'America/Los_Angeles' from orc_table 

SQL push-down with VARCHAR, DATE, TIME and TIMESTAMP types

  1. Teradata 152t version
    Limited SQL push-down

Presto does not “push-down” aggregate calculations to PostgreSQL or MySQL
http://teradata.github.io/presto/docs/current/release/release-0.152.1-t.html

  1. FB 0.148 later version
    https://prestodb.io/docs/current/release/release-0.148.html

MySQL and PostgreSQL Changes : Push down predicates for VARCHAR, DATE, TIME and TIMESTAMP types.

Hive : Push down filters for columns of type DECIMAL.

Our question is:
In my opinion, Teradata 152t should be rebased with FB 0.152 and with Teradata enhanced patches.
however in 152t we can't find new push downs in MySQL, is Teradata 152t document old or 152t didn't merged the MySQL VARCHAR push down patches?

Is this a bug about get query peak memory usage ?

Dear team

We are considering to get query peak memory usage.
I tried to run a query with Hive connector.
Coordinator UI
However found Peak Memory always be 0B.

In google user groups , it is also pointed out that "Presto coordinator UI shows you the peak memory usage of a query and current memory usage of a query".

Is this a bug in displaying Peak Memory?

Commits to backport to 0.179-t

List of back ported changes for 0.180

  • Skip unknown costs in EXPLAIN output. Get confirmation from @findepi that this is on our release branch. Already on release branch.
  • Fix incorrect results when performing comparisons between values of approximate data types (REAL, DOUBLE) and columns of certain exact numeric types (INTEGER, BIGINT, DECIMAL). Get confirmation from @losipiuk that this is on our release branch. Merged to release branch.
  • Ignore partition bucketing if table is not bucketed. This allows dropping the bucketing from table metadata but leaving it for old partitions. TODO: add proper testing.

Nice to haves from 0.180

Release notes: https://prestodb.io/docs/current/release/release-0.180.html
Detailed release notes: prestodb#8358

List of back ported changes for 0.181

  • Fix flaky level multipliers test by increasing tolerances. Relies on code not on the release branch. Will not backport.
  • Fix flaky test (TestQueues::testTooManyQueries). Merged to release branch.
  • Fix query failure when ORDER BY expressions reference columns that are used in the GROUP BY clause by their fully-qualified name. Confirm with @sopel39 that this is on our release branch. Merged to release branch.
  • Fix potential native memory leak when writing tables using RCFile. Merged to release branch.
  • Fix regression that broke serialization of SchemaTableName. Merged to release branch.

Nice to haves from 0.181

  • Improve performance of count(*) aggregations over subqueries with known constant cardinality. Talk to @kokosing and discuss potential impact of porting this. @kokosing does not think this should be back ported.

Release notes: https://prestodb.io/docs/current/release/release-0.181.html
Detailed release notes: prestodb#8485

List of back ported changes for 0.182

  • Fix the EXPLAIN query plan for tables that are partitioned on TIMESTAMP or DATE columns. Confirm with @maciejgrzybek that this is on our release branch. Already on release branch.
  • Fix an issue that would sometimes prevent queries from being cancelled when exiting from the pager. Talk to @arhimondr and confirm this got ported. Merged to release branch.

Nice to haves from 0.182

  • Fix correctness issue that causes corr() to return positive numbers for inverse correlations.

Release notes: https://prestodb.io/docs/current/release/release-0.182.html
Detailed release notes: prestodb#8622

"hive"."information_schema"."columns" with 'like' hangs

We, at LendingClub, using Presto "0.157.1.t.0.5-1.x86_64" and we access Presto with JDBC in SQLWorkBench. while we are trying to list Columns in any database in "Database Explorer" , most of the time it hangs.

We saw in Discovery URI the query its trying is

SELECT * FROM "hive"."information_schema"."columns" WHERE table_schema like '<schema_name>' AND table_name LIKE '<table_name>' AND column_name LIKE '%'

In this scenario, most of the time this hangs .

We also reproduced the issue by smaller query

SELECT * FROM "hive"."information_schema"."columns" WHERE table_schema like '<schema_name>' hangs

But

SELECT * FROM "hive"."information_schema"."columns" WHERE table_schema = '<schema_name>'

send result immediately

Check if second run of rule does not fire in RuleTester

It seems natural that optimizer Rule if run on output of the same Rule should not fire again.
While we may think of rules that do only part of work on first run, and some continuation of work on the second run, it does not seem like necessary feature.

If we agree that we do not need that, let's add a check to RuleTest verifying that:

rule.apply(rule.apply(plan)) == Optional.empty()

Unable to set system access control

Hi,

I have some problem setting custom system access control at Teradata's version of presto (0.152.1-T.0.2). I wrote a plugin that only allows users to use kerberos principal's username as their username (I named it custom-access-control). It works as expected on prestodb/presto version of persto (0.152), but with Teradata's version it uses allow-all policy, instead of custom-access-control.

I created /etc/presto/access-control.properties file on the coordinator and copy it to the workers as well. The content of the file is: access-control.name=custom-access-control

I also tried to put access-control.properties into /etc/opt/prestoadmin/coordinator and into /etc/opt/prestoadmin/worker folders and redeploy the configuration with presto-admin.

According to the log file the plugin loaded and registered successfully.

2016-11-24T06:59:08.524-0500	INFO	main	com.facebook.presto.server.PluginManager	-- Loading plugin /usr/lib/presto/lib/plugin/custom_access_control --
2016-11-24T06:59:08.525-0500	INFO	main	com.facebook.presto.server.PluginManager	Installing com.ge.presto.custom.access.control.CustomSystemAccessControlPlugin
2016-11-24T06:59:08.525-0500	INFO	main	com.facebook.presto.server.PluginManager	Registering system access control custom-access-control
2016-11-24T06:59:08.525-0500	INFO	main	com.facebook.presto.server.PluginManager	-- Finished loading plugin /usr/lib/presto/lib/plugin/custom_access_control --

But it uses allow-all system access control:

2016-11-24T06:59:11.374-0500	INFO	main	com.facebook.presto.security.AccessControlManager	-- Loading system access control --
2016-11-24T06:59:11.374-0500	INFO	main	com.facebook.presto.security.AccessControlManager	-- Loaded system access control allow-all --

With prestodb/presto version, it looks like the following:

2016-11-24T06:53:44.955-0500	INFO	main	com.facebook.presto.security.AccessControlManager	-- Loading system access control --
2016-11-24T06:53:44.956-0500	INFO	main	com.facebook.presto.security.AccessControlManager	-- Loaded system access control custom-access-control --

Did I miss some other config settings that is required? Thanks for your help.

Persto 0.157.1 in EMR is facing issues regarding client side encryption AWS KMS Master Key

I am getting NULL values in all the rows in result set in presto client in using s3 partitions.

  1. Data is stored in the S3 in the encripted form(client side encription) using AWS KMS master key.
  2. EMR Hive is able to decrypt the S3 data on select query.
  3. We have done below listed settings in the Presto.hive.properties but it is returning null values(screenshot is attached) against all the rows on making select [query.]

hive properties
prestoqueryresult

Your help will be really appreciable.
Gaurav

Describe works, but select does not

I have set up two virtual machines. One with Teradata presto 203 (on redhat linux 7), and other with HDP 2.6 from the Kylo VM. When I run describe from presto on a managed table, it works fine. But select gives me an error. Please see the below log :

presto> use hive.toy_store;
USE
presto:toy_store> describe products;
Column | Type | Extra | Comment
--------------------+---------+-------+---------
productcode | varchar | |
productname | varchar | |
productline | varchar | |
productscale | varchar | |
productvendor | varchar | |
productdescription | varchar | |
quantityinstock | varchar | |
buyprice | double | |
msrp | double | |
processing_dttm | varchar | |
(10 rows)

Query 20180904_150104_00021_zhvt9, FINISHED, 1 node
Splits: 19 total, 19 done (100.00%)
0:01 [10 rows, 746B] [13 rows/s, 998B/s]

presto:toy_store> select count(*) from products;

Query 20180904_150115_00022_zhvt9, FAILED, 1 node
Splits: 17 total, 0 done (0.00%)
0:00 [0 rows, 0B] [0 rows/s, 0B/s]

Query 20180904_150115_00022_zhvt9 failed: Failed to list directory: hdfs://sandbox.kylo.io:8020/app/warehouse/toy_store/products


I have checked all permissions, modified the users - but none of them worked. I added the hive config properties as well to see if it works, but it did not.

Please suggest if I have missed on any properties to tap into.

Enable using customized endpoint for S3-Compatiable storage

Dear team,

The current Presto hive connector can't set customized endpoint for S3-Compatiable storage.
We are using S3-Compatiable storage, so we need to set the endpoint to our internal server.

I will send a PR about this enhancement.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.