Loading content...
Throughout this module, we've developed the individual skills of conceptual design—understanding high-level modeling, constructing ER diagrams, identifying entities, and discovering relationships. Now it's time to bring these skills together to create an initial schema: a complete, coherent conceptual model ready for validation and refinement.
The initial schema is not a final artifact frozen in time. It's the first complete draft—good enough to review with stakeholders, detailed enough to identify gaps, and structured enough to transition into logical design. Creating this schema requires synthesis: combining entities, relationships, attributes, and constraints into a unified whole.
This page guides you through the assembly process, provides validation techniques, discusses documentation practices, and prepares you for the next phase of database design.
By the end of this page, you will be able to assemble a complete initial schema from your conceptual design work, validate the schema for completeness and consistency, document decisions and assumptions, and prepare the model for transition to logical design. You'll understand what makes a schema 'good enough' for review and how to iterate based on feedback.
An initial schema is the first complete version of your conceptual data model. It integrates all the pieces you've developed:
Initial ≠ Final
The word "initial" is deliberate. This schema will evolve:
The goal isn't perfection—it's a solid foundation for iteration.
What Makes a Schema 'Complete Enough'
An initial schema should be:
It doesn't need to be:
| Element | Required for Initial Schema | Can Wait for Later |
|---|---|---|
| All entities identified | ✓ Yes | — |
| All relationships mapped | ✓ Yes | — |
| Primary keys defined | ✓ Yes | — |
| Cardinality specified | ✓ Yes | — |
| Participation constraints | ✓ Yes | — |
| Entity attributes listed | ✓ Yes | — |
| Attribute data types | Optional | ✓ Logical design |
| Foreign key details | Optional | ✓ Logical design |
| Index design | No | ✓ Physical design |
| Storage parameters | No | ✓ Physical design |
The initial schema should represent your best current understanding. If you're 80% confident it's correct, that's good enough to enter review. The remaining 20% will surface during validation. Waiting for 100% confidence means waiting forever—requirements change, understanding deepens, and perfection is impossible.
Schema assembly brings together all your conceptual design work. Follow this systematic process:
Step 1: Consolidate Entity Inventory
Create a master list of all entities:
Result: A definitive list like:
Step 2: Complete Attribute Specification
For each entity, list all attributes:
At this stage, you may note general types (text, number, date) but formal data types wait for logical design.
Step 3: Map All Relationships
Consolidate your relationship discoveries:
Result should look like:
Step 4: Draw the Complete Diagram
With all components identified, create the full ER diagram:
Step 5: Add Constraint Documentation
Capture constraints that don't fit diagram notation:
These go in supplementary documentation alongside the diagram.
A well-organized diagram communicates better. Group related entities spatially. Place central entities in the middle. Keep relationship lines as short and uncrossed as possible. Use consistent notation throughout. A messy diagram obscures understanding even if technically correct.
Before presenting the schema for review, perform systematic validation to catch issues you can identify yourself.
Validation 1: Requirements Traceability
Every requirement should map to the schema:
Example traceability:
| Requirement | Schema Element | Status |
|---|---|---|
| Track customer names | Customer.Name | ✓ Covered |
| Store order history | Order entity + Customer relationship | ✓ Covered |
| Calculate monthly revenue | Derived from Order.Total | ❓ Verify derivable |
| Track product reviews | ❌ Missing | Gap identified |
Validation 2: Query Walkthrough
For key queries stakeholders need, verify the model can answer them:
Example:
Validation 3: Instance Testing
Create sample instances to verify the model handles real data:
Example:
Validation 4: Edge Case Analysis
Consider edge cases and boundary conditions:
Edge cases often reveal participation constraint issues.
Validation 5: Semantic Accuracy
Verify the model captures meaning, not just structure:
Example: If an order can only have one shipping address, but the model shows M:N between Order and Address, invalid states are possible.
Self-validation before review builds credibility. If stakeholders find basic issues you should have caught—missing obvious entities, broken query paths, wrong cardinalities—they lose confidence in the model. Thorough self-validation elevates review discussions to genuine domain questions.
The initial schema must be validated by stakeholders who understand the domain. They'll catch issues you can't—because you don't know the business as deeply as they do.
Preparing for Review
Create presentation materials: Large, clear diagrams. Avoid dense, technical drawings. Consider separate views for different audiences.
Write a glossary: Define every entity in business terms. "Customer: An individual or organization who has registered to make purchases."
Prepare walkthrough scenarios: Real examples stakeholders can follow. "Let's trace what happens when customer Jane Doe places an order..."
List assumptions and decisions: Document where you made judgment calls. "We assumed employees can only belong to one department. Is this correct?"
Identify open questions: What couldn't you determine independently? "Does a product need to be categorized before it can be sold?"
Running the Review
Effective review sessions:
Start with context: Remind stakeholders of the purpose. "This model represents the data for the order management system."
Walk through, don't lecture: Step through scenarios, asking for confirmation. "So a customer places multiple orders over time—is that right?"
Encourage questions: Silence isn't agreement. Probe: "Does anything look unexpected or missing?"
Focus on semantics, not notation: Non-technical stakeholders may struggle with ER notation. Translate: "This line means one customer can have many orders."
Handling Feedback
Review feedback falls into categories:
Corrections: The model is wrong.
Additions: Something is missing.
Clarifications: The model is unclear.
Scope Questions: Uncertainty about whether something belongs.
Future Requirements: Things that will be needed but aren't in current scope.
Iterating After Review
Review typically requires model updates:
Most conceptual designs go through 2-3 review iterations before stabilizing.
Don't show the entire complex model to everyone. Operations staff care about transactions; they don't need to see HR entities. Create focused views for different audiences. The full model exists, but reviews use subsets relevant to each stakeholder group.
A diagram alone doesn't constitute complete documentation. The initial schema should be documented thoroughly for future reference, team communication, and design continuity.
Essential Documentation Components
1. Entity Catalog
For each entity, document:
Example:
Entity: Customer
Description: An individual or organization registered to make purchases
Primary Key: CustomerID
Attributes:
- CustomerID: Unique identifier for the customer
- Name: Full name of individual or organization name
- Email: Primary contact email (unique)
- RegistrationDate: Date customer account created
- Status: Active, Inactive, or Suspended
Constraints:
- Email must be unique across customers
- Status must be one of the defined values
Notes: We treat individuals and organizations as the same entity type.
Consider splitting if different attributes are needed.
2. Relationship Catalog
For each relationship, document:
| Component | Purpose | Audience | Format |
|---|---|---|---|
| ER Diagram | Visual overview of model | All stakeholders | Drawing/diagram tool export |
| Entity Catalog | Detailed entity definitions | Designers, developers | Structured document/wiki |
| Relationship Catalog | Relationship specifications | Designers, developers | Structured document/wiki |
| Glossary | Term definitions | All stakeholders | Alphabetical list |
| Assumptions Log | Documented decisions | Future maintainers | Dated log entries |
| Requirements Traceability | Requirements to model mapping | Project managers, QA | Matrix/table |
| Open Issues | Unresolved questions | Project team | Issue tracking system |
3. Glossary
Define all terms used in the model:
A glossary ensures everyone interprets terms consistently. When a new team member asks "What's an SKU?", the glossary answers.
4. Assumptions and Decisions Log
Record the reasoning behind decisions:
Date: 2024-01-15
Decision: Customer and Organization are the same entity
Rationale: Current requirements don't distinguish them;
attributes are identical; simplifies model.
Revisit if: Business needs different handling for B2B vs B2C.
Consider generalization/specialization.
This log prevents revisiting settled issues and explains why the model looks the way it does.
5. Known Limitations
Document what the model doesn't do:
Clear limitations prevent incorrect expectations.
Documentation Practices
Good documentation means the conceptual design can be understood by someone who wasn't present for its creation. This is crucial for team scaling, personnel changes, and long-term maintenance. The diagram shows what; the documentation explains why.
How do you know if your initial schema is good? Beyond basic correctness, quality schemas exhibit several characteristics that distinguish professional work from amateur attempts.
Criterion 1: Semantic Richness
The model captures meaning, not just structure:
A semantically rich model can be read as a description of the domain. Someone unfamiliar with the business should understand it from the model.
Criterion 2: Minimal Redundancy
Each fact is represented once:
Redundancy in conceptual models leads to redundancy in databases, causing update anomalies.
Criterion 3: Appropriate Abstraction
The model operates at the right level:
Criterion 4: Consistency
The model is internally coherent:
Criterion 5: Evolvability
The model can grow:
Criterion 6: Visual Clarity
The diagram communicates effectively:
Self-Assessment
Before presenting for review, rate your schema against these criteria:
Time invested in conceptual design quality pays off many times over. A high-quality initial schema accelerates logical design, reduces implementation rework, and prevents production data problems. Never rush conceptual design to start coding sooner—the technical debt will exceed any time 'saved.'
The initial schema will be transformed into a logical schema—typically a relational schema ready for implementation. Preparing for this transition ensures smooth continuity.
What Logical Design Will Do
Logical design transforms the conceptual model:
Information to Provide for Logical Design
To facilitate transition, ensure your conceptual design provides:
Clear Primary Keys: Every entity needs a clear identifier. Natural keys are preferred at conceptual level; surrogates may be added during logical design.
Cardinality and Participation: These determine foreign key placement and NULL constraints.
Weak Entity Indicators: Weak entities become tables with composite keys.
Relationship Attributes: These move to junction tables for M:N relationships.
Specialization Constraints: Disjoint/overlapping and total/partial guide table strategy.
Domain Constraints: Valid values for attributes guide CHECK constraints and lookup tables.
| Conceptual Element | Logical Representation | Considerations |
|---|---|---|
| Strong Entity | Table with primary key | Straightforward mapping |
| Weak Entity | Table with composite PK including owner's key | Foreign key to owner with CASCADE DELETE |
| 1:1 Relationship | Foreign key in one table (usually total participation side) | Choose which side holds FK |
| 1:N Relationship | Foreign key in 'N' side table | Standard pattern |
| M:N Relationship | Junction table with two foreign keys | Junction table may have additional columns for relationship attributes |
| Multivalued Attribute | Separate table with FK to owner | One row per value |
| Composite Attribute | Multiple columns or flattened | Design choice based on usage |
| Specialization | Single table, separate tables, or combined | Depends on overlap and coverage |
Anticipating Normalization
Logical design includes normalization—ensuring the design avoids update anomalies. While normalization is a logical design activity, you can anticipate issues:
A well-constructed conceptual model rarely needs significant restructuring during normalization—these issues are usually avoided by good entity identification.
Handoff Checklist
Before transitioning to logical design, verify:
Transition Meeting
If different people do conceptual and logical design, hold a handoff meeting:
Some worry that conceptual design is 'extra work' that logical design duplicates. It isn't. Conceptual design enables faster, more accurate logical design. It provides reviewed, validated requirements mapping. It documents decisions. It creates shared understanding. Far from duplicating effort, conceptual design reduces total project effort.
We've completed the conceptual design journey—from understanding high-level modeling principles through creating a validated initial schema. Let's consolidate the full module learning:
The Conceptual Design Discipline
Conceptual design is both an art and a discipline. The art lies in understanding domains, recognizing patterns, and making judgment calls when information is ambiguous. The discipline lies in systematic techniques, thorough validation, and rigorous documentation.
The best database designers combine both—creative insight guided by methodical process. They produce models that are accurate, complete, and elegant; models that stakeholders recognize as capturing their domain; models that serve as solid foundations for years of system evolution.
What's Next in the Database Design Journey
With conceptual design complete, the next phase is logical design—transforming the conceptual model into a schema for a specific data model paradigm (typically relational). Logical design includes:
The conceptual model you've created makes logical design faster and more accurate. Instead of guessing at table structures, the logical designer transforms a validated, documented conceptual design systematically.
Congratulations on Completing This Module
You now possess comprehensive skills in conceptual database design. You can analyze domains, construct ER diagrams, identify entities and relationships, create initial schemas, and validate your work to professional standards. These skills form the foundation of all database design work.
Congratulations! You've completed Module 3: Conceptual Design. You've mastered high-level modeling principles, ER diagram construction, entity and relationship identification, and created validated initial schemas. You're now prepared to move forward with logical and physical database design, building upon this solid conceptual foundation.