DocsAWS 101BlogServices

Glue

Data Catalog (databases, tables, partitions), crawlers, jobs, triggers, connections, schema registry.

JSON-RPC (X-Amz-Target) multi-tenant 65 operations

Quick start

import boto3
glue = boto3.client("glue", endpoint_url="http://localhost:4566",
                    region_name="us-east-1",
                    aws_access_key_id="test", aws_secret_access_key="test")
glue.create_database(DatabaseInput={"Name":"db"})
glue.create_table(DatabaseName="db",
    TableInput={"Name":"t","StorageDescriptor":{"Columns":[{"Name":"c","Type":"string"}]}})

Supported operations

65 operations exposed by this service as of MiniStack 1.3.14. Extracted directly from the handler dispatch in the source module.

BatchCreatePartition BatchDeleteTable BatchGetPartition BatchGetTriggers BatchStopJobRun CreateClassifier CreateConnection CreateCrawler CreateDatabase CreateJob CreatePartition CreatePartitionIndex CreateSecurityConfiguration CreateTable CreateTrigger CreateWorkflow DeleteClassifier DeleteConnection DeleteCrawler DeleteDatabase DeleteJob DeletePartition DeleteSecurityConfiguration DeleteTable DeleteTrigger DeleteWorkflow GetClassifier GetClassifiers GetConnection GetConnections GetCrawler GetCrawlerMetrics GetCrawlers GetDatabase GetDatabases GetJob GetJobRun GetJobRuns GetJobs GetPartition GetPartitionIndexes GetPartitions GetSecurityConfiguration GetSecurityConfigurations GetTable GetTables GetTags GetTrigger GetTriggers GetWorkflow ListTriggers StartCrawler StartJobRun StartTrigger StartWorkflowRun StopCrawler StopTrigger TagResource UntagResource UpdateCrawler UpdateDatabase UpdateJob UpdateTable UpdateTrigger UpdateWorkflow

CloudFormation

No CloudFormation resource types map to this service yet. Resources can still be created via the SDK or CLI.

Known limitations

  • Crawlers run but only enumerate catalog metadata; no real schema inference from S3 files.
  • Jobs store definitions but never execute Spark/Python shell scripts.

Source

  • ministack/services/glue.py:97-157

Read the source to verify the ops list above — dispatch tables and handler functions are the ground truth.