Select your cookie preferences

We use cookies and similar tools to enhance your experience, provide our services, deliver relevant advertising, and make improvements. Approved third parties also use these tools to help us deliver advertising and provide certain site features.

get_column_statistics_for_partition

get_column_statistics_for_partition(**kwargs)

Retrieves partition statistics of columns.

The Identity and Access Management (IAM) permission required for this operation is GetPartition .

See also: AWS API Documentation

Request Syntax

response = client.get_column_statistics_for_partition(
    CatalogId='string',
    DatabaseName='string',
    TableName='string',
    PartitionValues=[
        'string',
    ],
    ColumnNames=[
        'string',
    ]
)
Parameters
  • CatalogId (string) -- The ID of the Data Catalog where the partitions in question reside. If none is supplied, the Amazon Web Services account ID is used by default.
  • DatabaseName (string) --

    [REQUIRED]

    The name of the catalog database where the partitions reside.

  • TableName (string) --

    [REQUIRED]

    The name of the partitions' table.

  • PartitionValues (list) --

    [REQUIRED]

    A list of partition values identifying the partition.

    • (string) --
  • ColumnNames (list) --

    [REQUIRED]

    A list of the column names.

    • (string) --
Return type

dict

Returns

Response Syntax

{
    'ColumnStatisticsList': [
        {
            'ColumnName': 'string',
            'ColumnType': 'string',
            'AnalyzedTime': datetime(2015, 1, 1),
            'StatisticsData': {
                'Type': 'BOOLEAN'|'DATE'|'DECIMAL'|'DOUBLE'|'LONG'|'STRING'|'BINARY',
                'BooleanColumnStatisticsData': {
                    'NumberOfTrues': 123,
                    'NumberOfFalses': 123,
                    'NumberOfNulls': 123
                },
                'DateColumnStatisticsData': {
                    'MinimumValue': datetime(2015, 1, 1),
                    'MaximumValue': datetime(2015, 1, 1),
                    'NumberOfNulls': 123,
                    'NumberOfDistinctValues': 123
                },
                'DecimalColumnStatisticsData': {
                    'MinimumValue': {
                        'UnscaledValue': b'bytes',
                        'Scale': 123
                    },
                    'MaximumValue': {
                        'UnscaledValue': b'bytes',
                        'Scale': 123
                    },
                    'NumberOfNulls': 123,
                    'NumberOfDistinctValues': 123
                },
                'DoubleColumnStatisticsData': {
                    'MinimumValue': 123.0,
                    'MaximumValue': 123.0,
                    'NumberOfNulls': 123,
                    'NumberOfDistinctValues': 123
                },
                'LongColumnStatisticsData': {
                    'MinimumValue': 123,
                    'MaximumValue': 123,
                    'NumberOfNulls': 123,
                    'NumberOfDistinctValues': 123
                },
                'StringColumnStatisticsData': {
                    'MaximumLength': 123,
                    'AverageLength': 123.0,
                    'NumberOfNulls': 123,
                    'NumberOfDistinctValues': 123
                },
                'BinaryColumnStatisticsData': {
                    'MaximumLength': 123,
                    'AverageLength': 123.0,
                    'NumberOfNulls': 123
                }
            }
        },
    ],
    'Errors': [
        {
            'ColumnName': 'string',
            'Error': {
                'ErrorCode': 'string',
                'ErrorMessage': 'string'
            }
        },
    ]
}

Response Structure

  • (dict) --

    • ColumnStatisticsList (list) --

      List of ColumnStatistics that failed to be retrieved.

      • (dict) --

        Represents the generated column-level statistics for a table or partition.

        • ColumnName (string) --

          Name of column which statistics belong to.

        • ColumnType (string) --

          The data type of the column.

        • AnalyzedTime (datetime) --

          The timestamp of when column statistics were generated.

        • StatisticsData (dict) --

          A ColumnStatisticData object that contains the statistics data values.

          • Type (string) --

            The type of column statistics data.

          • BooleanColumnStatisticsData (dict) --

            Boolean column statistics data.

            • NumberOfTrues (integer) --

              The number of true values in the column.

            • NumberOfFalses (integer) --

              The number of false values in the column.

            • NumberOfNulls (integer) --

              The number of null values in the column.

          • DateColumnStatisticsData (dict) --

            Date column statistics data.

            • MinimumValue (datetime) --

              The lowest value in the column.

            • MaximumValue (datetime) --

              The highest value in the column.

            • NumberOfNulls (integer) --

              The number of null values in the column.

            • NumberOfDistinctValues (integer) --

              The number of distinct values in a column.

          • DecimalColumnStatisticsData (dict) --

            Decimal column statistics data.

            • MinimumValue (dict) --

              The lowest value in the column.

              • UnscaledValue (bytes) --

                The unscaled numeric value.

              • Scale (integer) --

                The scale that determines where the decimal point falls in the unscaled value.

            • MaximumValue (dict) --

              The highest value in the column.

              • UnscaledValue (bytes) --

                The unscaled numeric value.

              • Scale (integer) --

                The scale that determines where the decimal point falls in the unscaled value.

            • NumberOfNulls (integer) --

              The number of null values in the column.

            • NumberOfDistinctValues (integer) --

              The number of distinct values in a column.

          • DoubleColumnStatisticsData (dict) --

            Double column statistics data.

            • MinimumValue (float) --

              The lowest value in the column.

            • MaximumValue (float) --

              The highest value in the column.

            • NumberOfNulls (integer) --

              The number of null values in the column.

            • NumberOfDistinctValues (integer) --

              The number of distinct values in a column.

          • LongColumnStatisticsData (dict) --

            Long column statistics data.

            • MinimumValue (integer) --

              The lowest value in the column.

            • MaximumValue (integer) --

              The highest value in the column.

            • NumberOfNulls (integer) --

              The number of null values in the column.

            • NumberOfDistinctValues (integer) --

              The number of distinct values in a column.

          • StringColumnStatisticsData (dict) --

            String column statistics data.

            • MaximumLength (integer) --

              The size of the longest string in the column.

            • AverageLength (float) --

              The average string length in the column.

            • NumberOfNulls (integer) --

              The number of null values in the column.

            • NumberOfDistinctValues (integer) --

              The number of distinct values in a column.

          • BinaryColumnStatisticsData (dict) --

            Binary column statistics data.

            • MaximumLength (integer) --

              The size of the longest bit sequence in the column.

            • AverageLength (float) --

              The average bit sequence length in the column.

            • NumberOfNulls (integer) --

              The number of null values in the column.

    • Errors (list) --

      Error occurred during retrieving column statistics data.

      • (dict) --

        Encapsulates a column name that failed and the reason for failure.

        • ColumnName (string) --

          The name of the column that failed.

        • Error (dict) --

          An error message with the reason for the failure of an operation.

          • ErrorCode (string) --

            The code associated with this error.

          • ErrorMessage (string) --

            A message describing the error.

Exceptions

  • Glue.Client.exceptions.EntityNotFoundException
  • Glue.Client.exceptions.InvalidInputException
  • Glue.Client.exceptions.InternalServiceException
  • Glue.Client.exceptions.OperationTimeoutException
  • Glue.Client.exceptions.GlueEncryptionException