Harden database commands against failures #7369

onurctirtir · 2023-11-29T09:03:28Z

Today, for database commands that cannot be executed in a transaction block (**), we don't do much next time when the command is invoked, if the prior invocation failed in the midway, e.g., if CREATE DATABASE succeeded on some nodes but not on all. We should do something better than this. For example, @thanodnl shared this as an idea on how we could handle failure scenarios for CREATE DATABASE better:

eg, could we use a trick where we create the database with a temporary name on all nodes and then transactionally rename the database
we could track the temporary names in a local catalog table for cleanup later if required etc.

CREATE DATABASE ✅ Add failure handling for CREATE DATABASE commands #7483
DROP DATABASE
ALTER DATABASE SET TABLESPACE

In preprocess phase, we save the original database name, replace dbname field of CreatedbStmt with a temporary name (to let Postgres to create the database with the temporary name locally) and then we insert a cleanup record for the temporary database name on all nodes **(\*\*)**. And in postprocess phase, we first rename the temporary database back to its original name for local node and then return a list of distributed DDL jobs i) to create the database with the temporary name and then ii) to rename it back to its original name on other nodes. That way, if CREATE DATABASE fails on any of the nodes, the temporary database will be cleaned up by the cleanup records that we inserted in preprocess phase and in case of a failure, we won't leak any databases called as the name that user intended to use for the database. Solves the problem documented in #7369 for CREATE DATABASE commands. **(\*\*):** To ensure that we insert cleanup records on all nodes, with this PR we also start requiring having the coordinator in the metadata because otherwise we would skip inserting a cleanup record for the coordinator.

onurctirtir added usability DDL labels Nov 29, 2023

onurctirtir added this to the 12.3 Release milestone Nov 30, 2023

onurctirtir added the bug label Nov 30, 2023

onurctirtir self-assigned this Nov 30, 2023

onurctirtir mentioned this issue Feb 15, 2024

Add failure handling for CREATE DATABASE commands #7483

Merged

naisila modified the milestones: 12.2 Release, 13.1 Release, 13.1 Releasee Feb 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Harden database commands against failures #7369

Harden database commands against failures #7369

onurctirtir commented Nov 29, 2023 •

edited by naisila

Loading

Harden database commands against failures #7369

Harden database commands against failures #7369

Comments

onurctirtir commented Nov 29, 2023 • edited by naisila Loading

onurctirtir commented Nov 29, 2023 •

edited by naisila

Loading