Jump to content

Database refactoring

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by Harsha 2005 MT (talk | contribs) at 14:56, 10 August 2024 (changed some word and added commas to easy reading and changed to be easy reading text). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

A database refactoring is a simple change to a database schema that improves its design while retaining both its behavioral and informational semantics. Database refactoring does not change the way data is interpreted or used and does not fix bugs or add new functionality. Every refactoring to a database leaves the system in a working state, thus not causing maintenance lags, provided the meaningful data exists in the production environment.

A database refactoring is conceptually more difficult than a code refactoring; code refactorings only need to maintain behavioral semantics, while database refactorings also must maintain informational semantics.

A database schema is typically refactored for one of several reasons:

  • To develop the schema in an evolutionary manner in parallel with the evolutionary design of the rest of the system.
  • To fix design problems with an existing legacy database schema. Database refactorings are often motivated by the desire for database normalization of an existing production database, typically to "clean up" the design of the database.
  • To implement what would be a large (and potentially risky) change as a series of small, low-risk changes.

Categories of database refactoring

In 2006, Scott Ambler and Pramod Sadalage described the following categories of database refactoring:

  • Architecture Refactoring
A change that improves the overall manner in which external programs interact with a database.

Methods of the Architecture Refactoring category: Add CRUD Methods; Add Mirror Table; Add Read Method; Encapsulate Table with View; Introduce Calculation Method; Introduce Index; Introduce Read-Only Table; Migrate Method from Database; Migrate Method to Database; Replace Method(s) with View; Replace View with Method(s); Use Official Data Source.

  • Structural Refactoring
A change to the table structure of your database schema.

Methods of the Structural Refactoring category: Drop Column; Drop Table; Drop View; Introduce Calculated Column; Introduce Surrogate Key; Merge Columns; Merge Tables; Move Column; Rename Column; Rename Table; Rename View; Replace LOB with Table; Replace Column; Replace One-to-Many with Associative Tables; Replace Surrogate Key with Natural Key; Split Column; Split Table.

  • Data Quality Refactoring
A change that improves and ensures the consistency and usage of the values stored in the database.

Methods of the Data Quality Refactoring category: Add Lookup Table; Apply Standard Codes; Apply Standard Type; Consolidate Key Strategy; Drop Column Constraint; Drop Default Value; Drop Non-Nullable; Introduce Column Constraint; Introduce Common Format; Introduce Default Value; Make Column Non-Nullable; Move Data; Replace Type Code with Property Flags.

  • Referential Integrity Refactoring

Methods of Referential Integrity Refactoring category: Add Foreign Key Constraint; Add Trigger for Calculated Column; Drop Foreign Key Constraint; Introduce Cascading Delete; Introduce Hard Delete; Introduce Soft Delete; Introduce Trigger for History.

  • Transformation
A change that ensures a referenced row exists within another table and/or ensures that a row no longer needed is removed appropriately.

Methods of the Transformation category: Insert Data; Introduce New Column; Introduce New Table; Introduce View; Update Data

  • Method Refactoring
A change that improves the quality of a stored procedure, stored function, or trigger.

Methods of the Method Refactoring category: Parameterize Methods; Remove Parameter; Rename Method; Reorder Parameters; Replace Parameter with Explicit Methods; Consolidate Conditional Expression; Decompose Conditional; Extract Method; Introduce Variable; Remove Control Flag; Remove Middle Man; Replace Literal with Table Lookup; Replace Nested Conditional with Guard Clauses; Split Temporary Variable; Substitute Algorithm.

In 2019, Vladislav Struzik supplemented the categories of database refactoring with a new one:

  • Access Refactoring
A change that relates to data access.

Methods of the Access Refactoring category: Change Authentication Attributes; Revoke Authorization Privileges; Grant Authorization Privileges; Extract Database Schema; Merge Database Schemas.

Process of database refactoring

The process of database refactoring is the act of applying database refactorings to evolve an existing database schema (database refactoring is a core practice of evolutionary database design). There are three considerations that need to be taken into account:

  1. How a single refactoring is implemented
  2. How database refactorings are tracked and shared within organizations
  3. How a series of database refactorings is applied

See also

References