Database Management SystemsNetwork Model

The Network Data Model

LevelIntermediate

Duration60 mins

TopicNetwork Model

1 / 5

Graph Structure — The Network Data Model Foundation

Beyond the Tree: When Hierarchies Fall Short

By the late 1960s, the hierarchical model had proven its worth in production database systems, most notably through IBM's Information Management System (IMS). Organizations could faithfully represent tree-structured data—organizational charts, product catalogs, and bill-of-materials hierarchies. But as applications grew more sophisticated, a fundamental limitation became painfully apparent: the real world doesn't always organize itself into neat trees.

Consider a student enrolled in multiple courses. In a pure hierarchical model, where does this student belong? Under the Computer Science course? Under the Mathematics course? If we duplicate the student record under each course, we face data redundancy and consistency nightmares. If we choose only one parent, we lose critical relationship information. The rigid parent-child constraint of the hierarchical model—where every child has exactly one parent—simply couldn't accommodate the many-to-many relationships pervasive in real-world domains.

The network model emerged as a direct response to these limitations, offering a fundamentally different organizing principle based on graph structures rather than tree structures.

Learning Objectives

By the end of this page, you will understand: (1) How graph structures overcome the limitations of hierarchical trees, (2) The fundamental components of the network model—nodes, edges, and their semantics, (3) How multiple parentage enables many-to-many relationship modeling, (4) The formal mathematical definition of network structures, and (5) Why this architectural shift represented a significant leap in data modeling capability.

The Mathematical Foundation: From Trees to Graphs

To understand why the network model represented a fundamental advancement, we must first establish the mathematical distinction between trees and graphs.

Tree Structure (Hierarchical Model):

A tree is a connected, acyclic graph in which:

There exists exactly one root node with no parent
Every non-root node has exactly one parent
There are no cycles—you cannot traverse from a node back to itself without backtracking
There is exactly one path between any two nodes

Formally, for a tree with n nodes, there are exactly n-1 edges. The constraint of single parentage means that if we visualize relationships as arrows pointing from parent to child, each child has exactly one incoming arrow.

Graph Structure (Network Model):

A graph relaxes the tree constraints:

Multiple parents allowed: Any node can have zero, one, or many incoming edges
Cycles permitted: A path can lead back to its origin
Multiple paths possible: Different routes may connect the same pair of nodes
Richer connectivity: The relationship structure mirrors real-world complexity

Formally, a directed graph G = (V, E) consists of a set of vertices V and a set of directed edges E ⊆ V × V. For n vertices, the number of edges can range from 0 to n² (including self-loops) or n(n-1) (excluding self-loops), vastly exceeding the n-1 edges of a tree.

Tree vs. Graph: Structural Properties
Property	Tree (Hierarchical)	Graph (Network)
Parent count per node	Exactly 1 (except root)	0, 1, or many
Cycles	Not allowed	Permitted
Paths between nodes	Exactly 1	Potentially many
Edge count for n nodes	Exactly n-1	0 to n² (directed)
Root requirement	Exactly one required	Not required
Relationship expressiveness	Limited (1:N only)	Full (M:N supported)
Navigation complexity	Simple—follow parent/child	Complex—multiple paths possible

The Modeling Implications:

This seemingly simple mathematical relaxation—allowing multiple parents—has profound implications for data modeling:

Many-to-Many Relationships: A student can be linked to multiple courses while each course remains linked to multiple students. Neither entity must be designated as the "owner."
Shared Subordinates: A component that appears in multiple products need not be duplicated. A single part record can be referenced by multiple parent assemblies.
Network Semantics: Real-world networks—social connections, transportation routes, supply chains—can be modeled naturally without artificial decomposition.
Reference vs. Containment: Rather than physically nesting data (containment), the network model allows referencing the same data from multiple contexts.

The Power of Multiple Parentage

The single most important difference between hierarchical and network models is multiple parentage. This one change transforms the data model from representing trees to representing arbitrary directed graphs, enabling a leap in expressiveness that unlocks domains previously inaccessible to database management.

Network Model Components: Records and Sets

The network model introduces specific terminology for its graph-based structure. Understanding this vocabulary is essential for comprehending network database systems and the CODASYL standard that formalized them.

Record Types (Nodes):

A record type in the network model is analogous to an entity type in the ER model or a table in the relational model. It defines a template for storing related data items.

Key characteristics:

Each record type has a name (e.g., STUDENT, COURSE, ENROLLMENT)
Each contains data items (fields)—similar to columns in relational databases
Record occurrences (instances) are the actual data entries—similar to rows
Each occurrence has a database key—a unique identifier assigned by the DBMS, similar to a physical address or pointer

network_record_definition.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
RECORD TYPE: STUDENT
DATA ITEMS:
    StudentID       : INTEGER
    Name            : CHARACTER(50)
    DateOfBirth     : DATE
    Major           : CHARACTER(30)
    GPA             : DECIMAL(3,2)
 
RECORD TYPE: COURSE
DATA ITEMS:
    CourseCode      : CHARACTER(10)
    Title           : CHARACTER(100)
    Credits         : INTEGER
    Department      : CHARACTER(30)
    MaxEnrollment   : INTEGER
 
RECORD TYPE: INSTRUCTOR
DATA ITEMS:
    InstructorID    : INTEGER
    Name            : CHARACTER(50)
    Department      : CHARACTER(30)
    OfficeLocation  : CHARACTER(20)

Set Types (Edges):

A set type defines a named one-to-many (1:N) relationship between two record types. Despite the network model supporting many-to-many relationships conceptually, each individual set is a 1:N link. Many-to-many relationships are constructed by chaining multiple set types through an intermediate record type.

Set type components:

Owner record type: The "one" side of the 1:N relationship (the parent)
Member record type: The "many" side (the children)
Set name: An identifier for this relationship
Set occurrence: An actual instance—one owner record linked to zero or more member records

The term "set" is somewhat misleading mathematically—it's really an ordered collection (potentially with insertion order semantics) rather than an unordered mathematical set.

network_set_definition.txt

SET TYPE: DEPT_INSTRUCTOR
    OWNER:  DEPARTMENT
    MEMBER: INSTRUCTOR
    ORDER:  SORTED BY Instructor.Name
    -- Links each department to its instructors
 
SET TYPE: DEPT_COURSE
    OWNER:  DEPARTMENT
    MEMBER: COURSE
    ORDER:  FIRST (insert at beginning)
    -- Links each department to its courses
 
SET TYPE: COURSE_ENROLLMENT
    OWNER:  COURSE
    MEMBER: ENROLLMENT
    ORDER:  LAST (insert at end)
    -- Links each course to enrollment records
 
SET TYPE: STUDENT_ENROLLMENT
    OWNER:  STUDENT
    MEMBER: ENROLLMENT
    ORDER:  SORTED BY Enrollment.EnrollmentDate
    -- Links each student to enrollment records

Sets Enable the Network Structure

The key insight is that a single member record can belong to multiple sets of different types. The ENROLLMENT record above is a member of both COURSE_ENROLLMENT and STUDENT_ENROLLMENT sets. This dual membership is what creates the many-to-many relationship between STUDENT and COURSE—the graph structure emerges from overlapping set memberships.

Visualizing the Network Graph Structure

Abstract graph theory becomes tangible when we visualize network structures. Let's examine how the network model represents a university domain, contrasting it with how the same domain would be constrained in a hierarchical model.

The University Domain:

Consider these real-world facts:

A department has many instructors (1:N)
A department offers many courses (1:N)
An instructor can teach many courses (1:N, but potentially M:N)
A course can have many enrolled students (1:N from course perspective)
A student can enroll in many courses (1:N from student perspective)
Together, students and courses have an M:N enrollment relationship

Converting Mermaid diagram...

Key Observations in the Diagram:

ENROLLMENT as an Intersection Record: The ENROLLMENT record type has two owners—it belongs to both the COURSE_ENROLLMENT set and the STUDENT_ENROLLMENT set. This is the network model's technique for representing M:N relationships.
Multiple Set Memberships: Notice how COURSE is both a member (of DEPT_COURSE and TEACHES) and an owner (of COURSE_ENROLLMENT). Records can simultaneously play both roles.
No Single Root: Unlike hierarchical structures, there's no designated root. The structure forms a directed graph that can be navigated from any entry point.
Cycle Potential: If instructors could also be students (e.g., graduate teaching assistants), we could create a cycle: STUDENT → ENROLLMENT → COURSE → TEACHES → INSTRUCTOR → (back to STUDENT if instructor-student link existed).

Hierarchical Limitation

•Student must belong to ONE parent
•Either: STUDENT under COURSE (lose student identity)
•Or: COURSE under STUDENT (lose course identity)
•Or: Duplicate student records under each course
•Result: Either data loss or redundancy

Network Solution

•ENROLLMENT record has TWO owners
•STUDENT owns its ENROLLMENT instances
•COURSE owns the same ENROLLMENT instances
•Each student and course stored exactly once
•Result: Full relationship fidelity, no redundancy

Pointer Chains: How Links Are Implemented

Graph structures in network databases aren't merely conceptual—they're implemented through physical pointer chains stored directly in the database. Understanding this implementation illuminates both the power and the challenges of the network model.

Set Implementation via Linked Lists:

Each set occurrence (an owner with its members) is typically implemented as a circular linked list:

Owner Pointer Chain: The owner record contains a pointer to its first member
Member Chain: Each member record points to the next member in the set
Circular Return: The last member points back to the owner, completing the circle
Back Pointers (Optional): Members may also contain a pointer back to the owner for efficient owner retrieval

pointer_chain_structure.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
SET OCCURRENCE EXAMPLE: DEPT_INSTRUCTOR set for "Computer Science" department
 
┌─────────────────────────────────────────────────────────────────────┐
│  OWNER RECORD: DEPARTMENT "Computer Science"                        │
│  ┌───────────────────────────────────────────────────────────────┐  │
│  │ DeptID: "CS"                                                  │  │
│  │ Name: "Computer Science"                                      │  │
│  │ Building: "Engineering Hall"                                  │  │
│  │ FIRST_MEMBER_PTR: ──────────────────────────────────────────┐ │  │
│  └───────────────────────────────────────────────────────────┐ │ │  │
└──────────────────────────────────────────────────────────────┼─┼─┘  │
                                                               │ │
    ┌──────────────────────────────────────────────────────────┘ │
    │                                                            │
    ▼                                                            │
┌─────────────────────────────────────────────────────────────┐  │
│  MEMBER 1: INSTRUCTOR "Dr. Alice Chen"                       │  │
│  ┌───────────────────────────────────────────────────────┐   │  │
│  │ InstID: 101                                           │   │  │
│  │ Name: "Dr. Alice Chen"                                │   │  │
│  │ Office: "EH-301"                                      │   │  │
│  │ OWNER_PTR: ──────────────────────────────────────────────────┘
│  │ NEXT_MEMBER_PTR: ─────────────────────────────────────────┐
│  └───────────────────────────────────────────────────────┘   │
└──────────────────────────────────────────────────────────────┘
                                                               │
    ┌──────────────────────────────────────────────────────────┘
    │
    ▼
┌─────────────────────────────────────────────────────────────┐
│  MEMBER 2: INSTRUCTOR "Dr. Bob Martinez"                     │
│  ┌───────────────────────────────────────────────────────┐   │
│  │ InstID: 102                                           │   │
│  │ Name: "Dr. Bob Martinez"                              │   │
│  │ Office: "EH-305"                                      │   │
│  │ OWNER_PTR: → (points back to DEPARTMENT "CS")         │   │
│  │ NEXT_MEMBER_PTR: → (points to next member or owner)   │   │
│  └───────────────────────────────────────────────────────┘   │
└──────────────────────────────────────────────────────────────┘
        │
        │  ... more members ...
        │
        └──→ (circular: last member's NEXT_PTR → back to owner)

Dual Set Membership Illustrated:

When a record belongs to multiple sets, it participates in multiple pointer chains simultaneously. The ENROLLMENT record demonstrates this complexity—it contains pointers for both the COURSE_ENROLLMENT set and the STUDENT_ENROLLMENT set:

dual_set_membership.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
ENROLLMENT RECORD with DUAL SET MEMBERSHIP:
 
┌──────────────────────────────────────────────────────────────────────┐
│  ENROLLMENT RECORD                                                    │
│  ┌────────────────────────────────────────────────────────────────┐  │
│  │ EnrollmentID: 50001                                            │  │
│  │ Grade: "A"                                                     │  │
│  │ Semester: "Fall 2024"                                          │  │
│  │ EnrollmentDate: 2024-08-15                                     │  │
│  │                                                                 │  │
│  │ // COURSE_ENROLLMENT set pointers                              │  │
│  │ COURSE_OWNER_PTR: → COURSE "CS101"                             │  │
│  │ COURSE_NEXT_PTR: → next ENROLLMENT in CS101                    │  │
│  │ COURSE_PREV_PTR: → prev ENROLLMENT in CS101  (if doubly-linked)│  │
│  │                                                                 │  │
│  │ // STUDENT_ENROLLMENT set pointers                             │  │
│  │ STUDENT_OWNER_PTR: → STUDENT "John Smith"                      │  │
│  │ STUDENT_NEXT_PTR: → next ENROLLMENT for John                   │  │
│  │ STUDENT_PREV_PTR: → prev ENROLLMENT for John (if doubly-linked)│  │
│  └────────────────────────────────────────────────────────────────┘  │
└──────────────────────────────────────────────────────────────────────┘
 
This single ENROLLMENT record is simultaneously:
  - A member of the CS101 course's enrollment set (linked with other CS101 enrollments)
  - A member of John Smith's enrollment set (linked with John's other course enrollments)
 
Traversal from COURSE "CS101":
  CS101 → FIRST_MEMBER → ENROLL_50001 → NEXT → ENROLL_50002 → ... → (back to CS101)
 
Traversal from STUDENT "John Smith":
  John → FIRST_MEMBER → ENROLL_50001 → NEXT → ENROLL_50007 → ... → (back to John)

The Pointer Overhead

Each set membership requires additional pointer storage in the member record. A record belonging to N sets contains roughly 2N to 3N extra pointer fields (owner pointer plus chain pointers). This overhead is significant—it's one reason relational databases, which avoid explicit pointers, eventually dominated the market.

Formal Schema Definition in Network Databases

The network database schema formally defines the complete graph structure—all record types, their fields, and all set relationships between them. This schema is typically defined using a Data Definition Language (DDL) that became standardized through CODASYL.

Schema Components:

Schema Name: Identifies the entire database schema
Record Definitions: Each record type with its data items
Set Definitions: Each relationship with owner, member, and ordering specifications
Integrity Constraints: Rules governing set membership and data validity

university_network_schema.ddl
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
SCHEMA NAME IS UNIVERSITY_DB
 
RECORD NAME IS DEPARTMENT
    LOCATION MODE IS CALC USING DeptID
    DUPLICATES ARE NOT ALLOWED
    02 DeptID         TYPE IS CHARACTER 10
    02 DeptName       TYPE IS CHARACTER 50
    02 Building       TYPE IS CHARACTER 30
    02 Budget         TYPE IS DECIMAL 12
 
RECORD NAME IS INSTRUCTOR
    LOCATION MODE IS VIA DEPT_INSTRUCTOR SET
    02 InstructorID   TYPE IS DECIMAL 8
    02 Name           TYPE IS CHARACTER 50
    02 HireDate       TYPE IS DATE
    02 Salary         TYPE IS DECIMAL 10
    02 Office         TYPE IS CHARACTER 20
 
RECORD NAME IS COURSE
    LOCATION MODE IS CALC USING CourseCode
    02 CourseCode     TYPE IS CHARACTER 10
    02 Title          TYPE IS CHARACTER 100
    02 Credits        TYPE IS DECIMAL 2
    02 Description    TYPE IS CHARACTER 500
 
RECORD NAME IS STUDENT
    LOCATION MODE IS CALC USING StudentID
    02 StudentID      TYPE IS DECIMAL 10
    02 Name           TYPE IS CHARACTER 50
    02 DateOfBirth    TYPE IS DATE
    02 Major          TYPE IS CHARACTER 30
    02 GPA            TYPE IS DECIMAL 3
 
RECORD NAME IS ENROLLMENT
    LOCATION MODE IS VIA COURSE_ENROLLMENT SET
    02 EnrollmentDate TYPE IS DATE
    02 Grade          TYPE IS CHARACTER 2
    02 Semester       TYPE IS CHARACTER 20
 
SET NAME IS DEPT_INSTRUCTOR
    OWNER IS DEPARTMENT
    ORDER IS SORTED BY DEFINED KEYS
    MEMBER IS INSTRUCTOR
        KEY IS Name ASCENDING
        INSERTION IS AUTOMATIC
        RETENTION IS MANDATORY
        SET SELECTION IS THRU CURRENT OF DEPT_INSTRUCTOR
 
SET NAME IS DEPT_COURSE
    OWNER IS DEPARTMENT
    ORDER IS SORTED BY DEFINED KEYS
    MEMBER IS COURSE
        KEY IS CourseCode ASCENDING
        INSERTION IS MANUAL
        RETENTION IS OPTIONAL
        SET SELECTION IS BY APPLICATION
 
SET NAME IS COURSE_ENROLLMENT
    OWNER IS COURSE
    ORDER IS LAST
    MEMBER IS ENROLLMENT
        INSERTION IS AUTOMATIC
        RETENTION IS FIXED
        SET SELECTION IS STRUCTURAL
 
SET NAME IS STUDENT_ENROLLMENT
    OWNER IS STUDENT
    ORDER IS SORTED BY DEFINED KEYS
    MEMBER IS ENROLLMENT
        KEY IS EnrollmentDate ASCENDING
        INSERTION IS AUTOMATIC
        RETENTION IS FIXED
        SET SELECTION IS STRUCTURAL

Key Schema Concepts

•LOCATION MODE: How records are physically stored. CALC uses hashing on key field; VIA stores members near their owner for efficient traversal.
•INSERTION IS AUTOMATIC/MANUAL: Whether member records are automatically added to sets or require explicit application code.
•RETENTION IS MANDATORY/OPTIONAL/FIXED: Whether members can be removed from sets, must always belong, or are permanently fixed once inserted.
•ORDER IS SORTED/FIRST/LAST: How members are ordered within a set occurrence—sorted by key, inserted at front, or appended at end.
•SET SELECTION: How the DBMS determines which set occurrence to use when inserting—current position, application-specified, or structurally determined.

Graph Theory Concepts in the Network Model

Understanding the network model deeply requires connecting it to fundamental graph theory concepts. These formal foundations explain both the power and the computational characteristics of network databases.

Directed Graphs (Digraphs):

Network database structures form directed graphs—graphs where edges have direction (from owner to member). Key properties:

Vertices (Nodes): Record occurrences in the database
Directed Edges (Arcs): Set membership links from owner to member
In-Degree: Number of sets a record is a member of (multiple parentage)
Out-Degree: Number of set occurrences a record owns (children count)

Graph Theory Terminology in Network Databases
Graph Theory Term	Network Database Equivalent	Example
Vertex/Node	Record occurrence	A specific STUDENT record
Directed Edge	Set membership (owner→member)	DEPT→INSTRUCTOR link
Path	Navigation sequence through sets	DEPT→COURSE→ENROLLMENT→STUDENT
Cycle	Circular navigation possible	STUDENT→ENROLLMENT→COURSE→TA→STUDENT
Degree	Set participations (owner+member)	ENROLLMENT: in-degree=2, out-degree=0
Connected Component	All reachable records from a starting point	All data linked to a DEPARTMENT
DAG (Directed Acyclic)	Network without cycles	Typical organizational hierarchies

Connectivity and Reachability:

One of the most significant graph properties for databases is reachability—which records can be accessed from a given starting point by following set links?

Definition: Record B is reachable from Record A if there exists a path:
  A → X₁ → X₂ → ... → Xₙ → B
  where each arrow represents traversing a set (in either direction: owner→member or member→owner)

In a well-designed network database:

All records should be reachable from at least one entry point
Entry points are often SYSTEM-owned sets or CALC-located record types
Unreachable records are effectively invisible to applications—a form of data loss

Path Length and Access Efficiency:

The shortest path length between two records directly impacts query performance:

Path length 1: Direct set relationship (fastest)
Path length 2: One intermediate record
Path length N: N-1 set traversals required

Network database designers optimize the schema to minimize path lengths for common query patterns while maintaining semantic accuracy.

Schema Design = Graph Design

Network database schema design is fundamentally graph structure design. The goal is to create a graph where frequently-needed navigation paths are short and efficient, while accurately representing real-world entity relationships. This dual optimization—performance and correctness—defines the network database design challenge.

Advantages of the Graph-Based Architecture

The graph structure of the network model provided substantial advantages over hierarchical systems, addressing many real-world modeling challenges:

1. Natural Many-to-Many Representation:

The most significant advantage is native support for M:N relationships through intersection records with multiple set memberships. This maps naturally to:

Students enrolled in courses
Employees assigned to projects
Parts used in multiple products
Authors of multiple books

2. Elimination of Data Redundancy:

By allowing multiple parents (owner records), shared entities need only be stored once. A supplier used by multiple purchasing departments exists as a single record, linked via sets to each department—not copied into each.

Key Advantages

•Semantic Richness: Models complex real-world relationships that trees cannot represent without distortion.
•Data Integrity: Single-copy storage eliminates update anomalies from redundancy. Change once, reflected everywhere.
•Efficient Navigation: Pre-built pointer chains enable fast traversal—no joins needed at runtime for related data.
•Flexible Entry Points: Navigate the graph from any record type, not just from a designated root.
•Complex Query Paths: Traverse multiple relationship types in a single navigation sequence.
•Storage Efficiency: No redundant data copies means smaller database footprint for relationship-heavy domains.

When Networks Excel

Network structures shine in domains with: (1) Complex many-to-many relationships, (2) Frequent traversal-based queries following relationships, (3) Stable schema that won't require frequent restructuring, (4) High-volume transaction processing needing predictable performance.

The Tradeoffs

However, graph structures introduce: (1) Schema rigidity—changing relationships means restructuring pointers, (2) Navigation complexity—programmers must understand the full graph, (3) Pointer overhead—storage cost for link maintenance, (4) System dependency—pointer values are internal identifiers.

Summary: Graph Structure as Network Foundation

We've explored the fundamental architectural decision that distinguishes network databases from their hierarchical predecessors: the shift from tree structures to graph structures.

Key Takeaways

•Trees allow only single parentage; graphs allow records to have multiple parent relationships through different set types.
•Record types define entities (like tables), while set types define 1:N relationships between them. M:N relationships emerge from dual set memberships.
•Pointer chains physically implement the graph structure, with member records containing pointers for each set they belong to.
•Schema design in network databases is graph design—optimizing for short paths to frequently-accessed data while maintaining semantic accuracy.
•The graph structure eliminates hierarchical limitations but introduces complexity: navigation requires understanding the full relationship topology.
•Graph theory concepts—reachability, path length, cycles—directly apply to network database design and performance analysis.

What's Next:

Understanding the graph structure provides the foundation. Next, we'll explore CODASYL—the committee that standardized the network model, creating the specifications that governed commercial network database systems like IDMS, IDS II, and DMS-1100. We'll see how they formalized set operations, navigation primitives, and data manipulation in a comprehensive standard that shaped database technology for decades.

Page Complete

You now understand the graph-based architecture that defines the network data model. The ability to model many-to-many relationships through multiple set memberships was revolutionary, enabling database systems to faithfully represent complex real-world domains that hierarchies could not capture. Next, we'll examine how CODASYL standardized these concepts.

1 / 5

Loading learning content...

Database Management SystemsNetwork Model

The Network Data Model

LevelIntermediate

Duration60 mins

TopicNetwork Model

1 / 5

Graph Structure — The Network Data Model Foundation

Beyond the Tree: When Hierarchies Fall Short

The network model emerged as a direct response to these limitations, offering a fundamentally different organizing principle based on graph structures rather than tree structures.

Learning Objectives

The Mathematical Foundation: From Trees to Graphs

To understand why the network model represented a fundamental advancement, we must first establish the mathematical distinction between trees and graphs.

Tree Structure (Hierarchical Model):

A tree is a connected, acyclic graph in which:

There exists exactly one root node with no parent
Every non-root node has exactly one parent
There are no cycles—you cannot traverse from a node back to itself without backtracking
There is exactly one path between any two nodes

Graph Structure (Network Model):

A graph relaxes the tree constraints:

Multiple parents allowed: Any node can have zero, one, or many incoming edges
Cycles permitted: A path can lead back to its origin
Multiple paths possible: Different routes may connect the same pair of nodes
Richer connectivity: The relationship structure mirrors real-world complexity

Tree vs. Graph: Structural Properties
Property	Tree (Hierarchical)	Graph (Network)
Parent count per node	Exactly 1 (except root)	0, 1, or many
Cycles	Not allowed	Permitted
Paths between nodes	Exactly 1	Potentially many
Edge count for n nodes	Exactly n-1	0 to n² (directed)
Root requirement	Exactly one required	Not required
Relationship expressiveness	Limited (1:N only)	Full (M:N supported)
Navigation complexity	Simple—follow parent/child	Complex—multiple paths possible

The Modeling Implications:

This seemingly simple mathematical relaxation—allowing multiple parents—has profound implications for data modeling:

Many-to-Many Relationships: A student can be linked to multiple courses while each course remains linked to multiple students. Neither entity must be designated as the "owner."
Shared Subordinates: A component that appears in multiple products need not be duplicated. A single part record can be referenced by multiple parent assemblies.
Network Semantics: Real-world networks—social connections, transportation routes, supply chains—can be modeled naturally without artificial decomposition.
Reference vs. Containment: Rather than physically nesting data (containment), the network model allows referencing the same data from multiple contexts.

The Power of Multiple Parentage

Network Model Components: Records and Sets

Record Types (Nodes):

A record type in the network model is analogous to an entity type in the ER model or a table in the relational model. It defines a template for storing related data items.

Key characteristics:

Each record type has a name (e.g., STUDENT, COURSE, ENROLLMENT)
Each contains data items (fields)—similar to columns in relational databases
Record occurrences (instances) are the actual data entries—similar to rows
Each occurrence has a database key—a unique identifier assigned by the DBMS, similar to a physical address or pointer

network_record_definition.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
RECORD TYPE: STUDENT
DATA ITEMS:
    StudentID       : INTEGER
    Name            : CHARACTER(50)
    DateOfBirth     : DATE
    Major           : CHARACTER(30)
    GPA             : DECIMAL(3,2)
 
RECORD TYPE: COURSE
DATA ITEMS:
    CourseCode      : CHARACTER(10)
    Title           : CHARACTER(100)
    Credits         : INTEGER
    Department      : CHARACTER(30)
    MaxEnrollment   : INTEGER
 
RECORD TYPE: INSTRUCTOR
DATA ITEMS:
    InstructorID    : INTEGER
    Name            : CHARACTER(50)
    Department      : CHARACTER(30)
    OfficeLocation  : CHARACTER(20)

Set Types (Edges):

Set type components:

Owner record type: The "one" side of the 1:N relationship (the parent)
Member record type: The "many" side (the children)
Set name: An identifier for this relationship
Set occurrence: An actual instance—one owner record linked to zero or more member records

The term "set" is somewhat misleading mathematically—it's really an ordered collection (potentially with insertion order semantics) rather than an unordered mathematical set.

network_set_definition.txt

SET TYPE: DEPT_INSTRUCTOR
    OWNER:  DEPARTMENT
    MEMBER: INSTRUCTOR
    ORDER:  SORTED BY Instructor.Name
    -- Links each department to its instructors
 
SET TYPE: DEPT_COURSE
    OWNER:  DEPARTMENT
    MEMBER: COURSE
    ORDER:  FIRST (insert at beginning)
    -- Links each department to its courses
 
SET TYPE: COURSE_ENROLLMENT
    OWNER:  COURSE
    MEMBER: ENROLLMENT
    ORDER:  LAST (insert at end)
    -- Links each course to enrollment records
 
SET TYPE: STUDENT_ENROLLMENT
    OWNER:  STUDENT
    MEMBER: ENROLLMENT
    ORDER:  SORTED BY Enrollment.EnrollmentDate
    -- Links each student to enrollment records

Sets Enable the Network Structure

Visualizing the Network Graph Structure

The University Domain:

Consider these real-world facts:

A department has many instructors (1:N)
A department offers many courses (1:N)
An instructor can teach many courses (1:N, but potentially M:N)
A course can have many enrolled students (1:N from course perspective)
A student can enroll in many courses (1:N from student perspective)
Together, students and courses have an M:N enrollment relationship

Converting Mermaid diagram...

Key Observations in the Diagram:

ENROLLMENT as an Intersection Record: The ENROLLMENT record type has two owners—it belongs to both the COURSE_ENROLLMENT set and the STUDENT_ENROLLMENT set. This is the network model's technique for representing M:N relationships.
Multiple Set Memberships: Notice how COURSE is both a member (of DEPT_COURSE and TEACHES) and an owner (of COURSE_ENROLLMENT). Records can simultaneously play both roles.
No Single Root: Unlike hierarchical structures, there's no designated root. The structure forms a directed graph that can be navigated from any entry point.
Cycle Potential: If instructors could also be students (e.g., graduate teaching assistants), we could create a cycle: STUDENT → ENROLLMENT → COURSE → TEACHES → INSTRUCTOR → (back to STUDENT if instructor-student link existed).

Hierarchical Limitation

•Student must belong to ONE parent
•Either: STUDENT under COURSE (lose student identity)
•Or: COURSE under STUDENT (lose course identity)
•Or: Duplicate student records under each course
•Result: Either data loss or redundancy

Network Solution

•ENROLLMENT record has TWO owners
•STUDENT owns its ENROLLMENT instances
•COURSE owns the same ENROLLMENT instances
•Each student and course stored exactly once
•Result: Full relationship fidelity, no redundancy

Pointer Chains: How Links Are Implemented

Set Implementation via Linked Lists:

Each set occurrence (an owner with its members) is typically implemented as a circular linked list:

Owner Pointer Chain: The owner record contains a pointer to its first member
Member Chain: Each member record points to the next member in the set
Circular Return: The last member points back to the owner, completing the circle
Back Pointers (Optional): Members may also contain a pointer back to the owner for efficient owner retrieval

pointer_chain_structure.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
SET OCCURRENCE EXAMPLE: DEPT_INSTRUCTOR set for "Computer Science" department
 
┌─────────────────────────────────────────────────────────────────────┐
│  OWNER RECORD: DEPARTMENT "Computer Science"                        │
│  ┌───────────────────────────────────────────────────────────────┐  │
│  │ DeptID: "CS"                                                  │  │
│  │ Name: "Computer Science"                                      │  │
│  │ Building: "Engineering Hall"                                  │  │
│  │ FIRST_MEMBER_PTR: ──────────────────────────────────────────┐ │  │
│  └───────────────────────────────────────────────────────────┐ │ │  │
└──────────────────────────────────────────────────────────────┼─┼─┘  │
                                                               │ │
    ┌──────────────────────────────────────────────────────────┘ │
    │                                                            │
    ▼                                                            │
┌─────────────────────────────────────────────────────────────┐  │
│  MEMBER 1: INSTRUCTOR "Dr. Alice Chen"                       │  │
│  ┌───────────────────────────────────────────────────────┐   │  │
│  │ InstID: 101                                           │   │  │
│  │ Name: "Dr. Alice Chen"                                │   │  │
│  │ Office: "EH-301"                                      │   │  │
│  │ OWNER_PTR: ──────────────────────────────────────────────────┘
│  │ NEXT_MEMBER_PTR: ─────────────────────────────────────────┐
│  └───────────────────────────────────────────────────────┘   │
└──────────────────────────────────────────────────────────────┘
                                                               │
    ┌──────────────────────────────────────────────────────────┘
    │
    ▼
┌─────────────────────────────────────────────────────────────┐
│  MEMBER 2: INSTRUCTOR "Dr. Bob Martinez"                     │
│  ┌───────────────────────────────────────────────────────┐   │
│  │ InstID: 102                                           │   │
│  │ Name: "Dr. Bob Martinez"                              │   │
│  │ Office: "EH-305"                                      │   │
│  │ OWNER_PTR: → (points back to DEPARTMENT "CS")         │   │
│  │ NEXT_MEMBER_PTR: → (points to next member or owner)   │   │
│  └───────────────────────────────────────────────────────┘   │
└──────────────────────────────────────────────────────────────┘
        │
        │  ... more members ...
        │
        └──→ (circular: last member's NEXT_PTR → back to owner)

Dual Set Membership Illustrated:

dual_set_membership.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
ENROLLMENT RECORD with DUAL SET MEMBERSHIP:
 
┌──────────────────────────────────────────────────────────────────────┐
│  ENROLLMENT RECORD                                                    │
│  ┌────────────────────────────────────────────────────────────────┐  │
│  │ EnrollmentID: 50001                                            │  │
│  │ Grade: "A"                                                     │  │
│  │ Semester: "Fall 2024"                                          │  │
│  │ EnrollmentDate: 2024-08-15                                     │  │
│  │                                                                 │  │
│  │ // COURSE_ENROLLMENT set pointers                              │  │
│  │ COURSE_OWNER_PTR: → COURSE "CS101"                             │  │
│  │ COURSE_NEXT_PTR: → next ENROLLMENT in CS101                    │  │
│  │ COURSE_PREV_PTR: → prev ENROLLMENT in CS101  (if doubly-linked)│  │
│  │                                                                 │  │
│  │ // STUDENT_ENROLLMENT set pointers                             │  │
│  │ STUDENT_OWNER_PTR: → STUDENT "John Smith"                      │  │
│  │ STUDENT_NEXT_PTR: → next ENROLLMENT for John                   │  │
│  │ STUDENT_PREV_PTR: → prev ENROLLMENT for John (if doubly-linked)│  │
│  └────────────────────────────────────────────────────────────────┘  │
└──────────────────────────────────────────────────────────────────────┘
 
This single ENROLLMENT record is simultaneously:
  - A member of the CS101 course's enrollment set (linked with other CS101 enrollments)
  - A member of John Smith's enrollment set (linked with John's other course enrollments)
 
Traversal from COURSE "CS101":
  CS101 → FIRST_MEMBER → ENROLL_50001 → NEXT → ENROLL_50002 → ... → (back to CS101)
 
Traversal from STUDENT "John Smith":
  John → FIRST_MEMBER → ENROLL_50001 → NEXT → ENROLL_50007 → ... → (back to John)

The Pointer Overhead

Formal Schema Definition in Network Databases

Schema Components:

Schema Name: Identifies the entire database schema
Record Definitions: Each record type with its data items
Set Definitions: Each relationship with owner, member, and ordering specifications
Integrity Constraints: Rules governing set membership and data validity

university_network_schema.ddl
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
SCHEMA NAME IS UNIVERSITY_DB
 
RECORD NAME IS DEPARTMENT
    LOCATION MODE IS CALC USING DeptID
    DUPLICATES ARE NOT ALLOWED
    02 DeptID         TYPE IS CHARACTER 10
    02 DeptName       TYPE IS CHARACTER 50
    02 Building       TYPE IS CHARACTER 30
    02 Budget         TYPE IS DECIMAL 12
 
RECORD NAME IS INSTRUCTOR
    LOCATION MODE IS VIA DEPT_INSTRUCTOR SET
    02 InstructorID   TYPE IS DECIMAL 8
    02 Name           TYPE IS CHARACTER 50
    02 HireDate       TYPE IS DATE
    02 Salary         TYPE IS DECIMAL 10
    02 Office         TYPE IS CHARACTER 20
 
RECORD NAME IS COURSE
    LOCATION MODE IS CALC USING CourseCode
    02 CourseCode     TYPE IS CHARACTER 10
    02 Title          TYPE IS CHARACTER 100
    02 Credits        TYPE IS DECIMAL 2
    02 Description    TYPE IS CHARACTER 500
 
RECORD NAME IS STUDENT
    LOCATION MODE IS CALC USING StudentID
    02 StudentID      TYPE IS DECIMAL 10
    02 Name           TYPE IS CHARACTER 50
    02 DateOfBirth    TYPE IS DATE
    02 Major          TYPE IS CHARACTER 30
    02 GPA            TYPE IS DECIMAL 3
 
RECORD NAME IS ENROLLMENT
    LOCATION MODE IS VIA COURSE_ENROLLMENT SET
    02 EnrollmentDate TYPE IS DATE
    02 Grade          TYPE IS CHARACTER 2
    02 Semester       TYPE IS CHARACTER 20
 
SET NAME IS DEPT_INSTRUCTOR
    OWNER IS DEPARTMENT
    ORDER IS SORTED BY DEFINED KEYS
    MEMBER IS INSTRUCTOR
        KEY IS Name ASCENDING
        INSERTION IS AUTOMATIC
        RETENTION IS MANDATORY
        SET SELECTION IS THRU CURRENT OF DEPT_INSTRUCTOR
 
SET NAME IS DEPT_COURSE
    OWNER IS DEPARTMENT
    ORDER IS SORTED BY DEFINED KEYS
    MEMBER IS COURSE
        KEY IS CourseCode ASCENDING
        INSERTION IS MANUAL
        RETENTION IS OPTIONAL
        SET SELECTION IS BY APPLICATION
 
SET NAME IS COURSE_ENROLLMENT
    OWNER IS COURSE
    ORDER IS LAST
    MEMBER IS ENROLLMENT
        INSERTION IS AUTOMATIC
        RETENTION IS FIXED
        SET SELECTION IS STRUCTURAL
 
SET NAME IS STUDENT_ENROLLMENT
    OWNER IS STUDENT
    ORDER IS SORTED BY DEFINED KEYS
    MEMBER IS ENROLLMENT
        KEY IS EnrollmentDate ASCENDING
        INSERTION IS AUTOMATIC
        RETENTION IS FIXED
        SET SELECTION IS STRUCTURAL

Key Schema Concepts

•LOCATION MODE: How records are physically stored. CALC uses hashing on key field; VIA stores members near their owner for efficient traversal.
•INSERTION IS AUTOMATIC/MANUAL: Whether member records are automatically added to sets or require explicit application code.
•RETENTION IS MANDATORY/OPTIONAL/FIXED: Whether members can be removed from sets, must always belong, or are permanently fixed once inserted.
•ORDER IS SORTED/FIRST/LAST: How members are ordered within a set occurrence—sorted by key, inserted at front, or appended at end.
•SET SELECTION: How the DBMS determines which set occurrence to use when inserting—current position, application-specified, or structurally determined.

Graph Theory Concepts in the Network Model

Directed Graphs (Digraphs):

Network database structures form directed graphs—graphs where edges have direction (from owner to member). Key properties:

Vertices (Nodes): Record occurrences in the database
Directed Edges (Arcs): Set membership links from owner to member
In-Degree: Number of sets a record is a member of (multiple parentage)
Out-Degree: Number of set occurrences a record owns (children count)

Graph Theory Terminology in Network Databases
Graph Theory Term	Network Database Equivalent	Example
Vertex/Node	Record occurrence	A specific STUDENT record
Directed Edge	Set membership (owner→member)	DEPT→INSTRUCTOR link
Path	Navigation sequence through sets	DEPT→COURSE→ENROLLMENT→STUDENT
Cycle	Circular navigation possible	STUDENT→ENROLLMENT→COURSE→TA→STUDENT
Degree	Set participations (owner+member)	ENROLLMENT: in-degree=2, out-degree=0
Connected Component	All reachable records from a starting point	All data linked to a DEPARTMENT
DAG (Directed Acyclic)	Network without cycles	Typical organizational hierarchies

Connectivity and Reachability:

One of the most significant graph properties for databases is reachability—which records can be accessed from a given starting point by following set links?

Definition: Record B is reachable from Record A if there exists a path:
  A → X₁ → X₂ → ... → Xₙ → B
  where each arrow represents traversing a set (in either direction: owner→member or member→owner)

In a well-designed network database:

All records should be reachable from at least one entry point
Entry points are often SYSTEM-owned sets or CALC-located record types
Unreachable records are effectively invisible to applications—a form of data loss

Path Length and Access Efficiency:

The shortest path length between two records directly impacts query performance:

Path length 1: Direct set relationship (fastest)
Path length 2: One intermediate record
Path length N: N-1 set traversals required

Network database designers optimize the schema to minimize path lengths for common query patterns while maintaining semantic accuracy.

Schema Design = Graph Design

Advantages of the Graph-Based Architecture

The graph structure of the network model provided substantial advantages over hierarchical systems, addressing many real-world modeling challenges:

1. Natural Many-to-Many Representation:

The most significant advantage is native support for M:N relationships through intersection records with multiple set memberships. This maps naturally to:

Students enrolled in courses
Employees assigned to projects
Parts used in multiple products
Authors of multiple books

2. Elimination of Data Redundancy:

Key Advantages

•Semantic Richness: Models complex real-world relationships that trees cannot represent without distortion.
•Data Integrity: Single-copy storage eliminates update anomalies from redundancy. Change once, reflected everywhere.
•Efficient Navigation: Pre-built pointer chains enable fast traversal—no joins needed at runtime for related data.
•Flexible Entry Points: Navigate the graph from any record type, not just from a designated root.
•Complex Query Paths: Traverse multiple relationship types in a single navigation sequence.
•Storage Efficiency: No redundant data copies means smaller database footprint for relationship-heavy domains.

When Networks Excel

The Tradeoffs

Summary: Graph Structure as Network Foundation

We've explored the fundamental architectural decision that distinguishes network databases from their hierarchical predecessors: the shift from tree structures to graph structures.

Key Takeaways

•Trees allow only single parentage; graphs allow records to have multiple parent relationships through different set types.
•Record types define entities (like tables), while set types define 1:N relationships between them. M:N relationships emerge from dual set memberships.
•Pointer chains physically implement the graph structure, with member records containing pointers for each set they belong to.
•Schema design in network databases is graph design—optimizing for short paths to frequently-accessed data while maintaining semantic accuracy.
•The graph structure eliminates hierarchical limitations but introduces complexity: navigation requires understanding the full relationship topology.
•Graph theory concepts—reachability, path length, cycles—directly apply to network database design and performance analysis.

What's Next:

Page Complete

1 / 5