Database Management SystemsNetwork Model

The Network Data Model

LevelIntermediate

Duration60 mins

TopicNetwork Model

5 / 5

Comparison with Hierarchical Model — A Tale of Two Paradigms

Two Paths from the Same Origin

In the 1960s, as organizations began computerizing their data management, two competing paradigms emerged for structuring database systems. IBM's IMS championed the hierarchical model, organizing data as trees with strict parent-child relationships. GE's IDS and the subsequent CODASYL standard developed the network model, allowing more flexible graph structures.

These weren't academic differences—they represented fundamentally different philosophies about how data should be organized and accessed. For nearly two decades, organizations made strategic bets on one paradigm or the other, building massive applications that would run for decades.

Understanding the comparison between these models isn't merely historical interest. It illuminates the nature of data modeling trade-offs that persist today: simplicity versus flexibility, performance versus generality, and the eternal tension between constraint and expressiveness.

Learning Objectives

By the end of this page, you will understand: (1) The fundamental structural differences between trees and graphs, (2) How each model handles different relationship types (1:1, 1:N, M:N), (3) Performance characteristics and when each model excels, (4) Data redundancy implications, (5) The programming complexity of each approach, (6) Why both ultimately gave way to the relational model.

Structural Foundations: Trees vs. Graphs

The most fundamental difference lies in the underlying graph structures:

Hierarchical Model (Tree Structure):

Single root: Each database area has exactly one root record type
Single parent constraint: Every non-root record has exactly one parent
No cycles: The structure forms a Directed Acyclic Graph (specifically, a tree)
Implicit relationships: Parent-child links are the only relationships
Top-down access: Navigation primarily flows from root toward leaves

Network Model (Graph Structure):

Multiple entry points: Any record type can serve as a starting point
Multiple parents allowed: A record can be a member of multiple set types
Cycles permitted: The graph can contain circular paths
Named relationships: Set types explicitly define and name relationships
Multi-directional access: Navigate up, down, or across the graph

Converting Mermaid diagram...

Structural Property Comparison
Property	Hierarchical (Tree)	Network (Graph)
Mathematical structure	Rooted tree (arborescence)	Directed graph (may have cycles)
Parent count per node	Exactly 1 (except root: 0)	0, 1, or many (via different sets)
Paths between nodes	Exactly 1	Potentially many
Root requirement	Mandatory—one per hierarchy	None—any record can be an entry point
Relationship cardinality	1:N only (implicit)	1:N per set, M:N via intersection records
Navigation direction	Primarily parent→child	Any direction: owner→member, member→owner, across sets

The Single-Parent Constraint

The hierarchical model's single-parent constraint is both its defining characteristic and its primary limitation. It simplifies navigation (there's only one path to any record from the root), but it cannot naturally represent real-world scenarios where an entity 'belongs to' multiple parents.

Relationship Modeling Capabilities

The most significant practical difference between the models is how they handle various relationship types.

One-to-One (1:1) Relationships:

Both models handle 1:1 relationships adequately:

Hierarchical: Child segment under parent, with single occurrence
Network: Set with at most one member per owner occurrence

One-to-Many (1:N) Relationships:

Both models excel at 1:N—this is their natural structure:

Hierarchical: Parent with multiple children (department → employees)
Network: Owner with multiple members (department owns employee set)

Many-to-Many (M:N) Relationships:

This is where the models diverge dramatically.

Hierarchical: M:N Workarounds

The hierarchical model cannot directly represent M:N. Workarounds include:

Virtual Pairing: Create intersection segments under each parent (data duplication)

Logical Pointers: Store key values that reference records elsewhere (loses navigational performance)

Hierarchy Duplication: Repeat entire hierarchies from different perspectives

All workarounds have significant drawbacks.

Network: M:N Native Support

The network model handles M:N naturally:

Intersection Records: Create a record type that is a member of sets from both parents

Dual Ownership: The intersection record has two owners via different sets

No Duplication: Each entity appears once; relationships connect them

M:N is expressed without data redundancy.

many_to_many_comparison.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
/* ================================================================
   EXAMPLE: Students enrolled in Courses (M:N Relationship)
   - Each student can enroll in many courses
   - Each course can have many enrolled students
   ================================================================ */
 
/* HIERARCHICAL MODEL APPROACH */
 
Database Schema (IMS-style):
    COURSE (root)
        └── ENROLLMENT (child with pointer to STUDENT)
            Contains: STUDENT-KEY (logical child pointer)
            
    STUDENT (separate root in another hierarchy)
        └── S-ENROLLMENT (virtual segment)
            Physical twin of ENROLLMENT under COURSE
            
/* Problems:
   - ENROLLMENT data may be duplicated
   - Must maintain both paths in sync
   - Queries from student perspective require different access path
   - Complex pointer management */
 
 
/* NETWORK MODEL APPROACH */
 
    COURSE ──[COURSE_ENROLLMENT SET]──> ENROLLMENT
    STUDENT ──[STUDENT_ENROLLMENT SET]──> ENROLLMENT
    
    ENROLLMENT is a member of TWO sets:
    - COURSE_ENROLLMENT: owned by COURSE
    - STUDENT_ENROLLMENT: owned by STUDENT
 
/* Benefits:
   - Single ENROLLMENT record per student-course pair
   - Navigate from COURSE through ENROLLMENT to retrieve students
   - Navigate from STUDENT through ENROLLMENT to retrieve courses
   - No data duplication
   - Pointer chains maintained by DBMS */
 
/* Navigation from COURSE to get all enrolled STUDENTS: */
 
FIND ANY COURSE USING CourseCode = "CS101".
FIND FIRST ENROLLMENT WITHIN COURSE_ENROLLMENT.
PERFORM UNTIL end-of-set
    -- Navigate via other set to get student
    FIND OWNER WITHIN STUDENT_ENROLLMENT.
    GET STUDENT.
    DISPLAY StudentName.
    -- Return and continue
    FIND NEXT ENROLLMENT WITHIN COURSE_ENROLLMENT.
END-PERFORM.

IMS Logical Relationships

IBM enhanced IMS with 'logical relationships' to address M:N limitations. These allowed segments to have 'logical parents' in addition to their physical parent. While this extended capabilities, it added complexity and still required careful design to avoid data inconsistency. The network model's approach was inherently cleaner for M:N scenarios.

Data Redundancy and Consistency

The structural constraints of each model have profound implications for data redundancy and consistency maintenance.

Hierarchical Model Redundancy:

Because each record can have only one parent, data that belongs to multiple contexts must often be duplicated:

hierarchical_redundancy.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
/* Example: Supplier information in a manufacturing database */
 
/* Hierarchical approach - redundancy required: */
 
HIERARCHY 1: PRODUCT-focused
    PRODUCT
        └── PRODUCT_COMPONENT
            └── COMPONENT_SUPPLIER  // Contains supplier details
                - SupplierName
                - SupplierAddress
                - SupplierRating
 
HIERARCHY 2: SUPPLIER-focused  
    SUPPLIER  // Full supplier record
        └── SUPPLIER_PRODUCT  // Products they supply
 
/* Problem: If Supplier "Acme Corp" supplies 50 products,
   their name/address/rating appears in:
   - Once in SUPPLIER hierarchy
   - 50 times in COMPONENT_SUPPLIER segments under each product
   
   If Acme Corp moves to a new address:
   - Must update SUPPLIER record: 1 update
   - Must update 50 COMPONENT_SUPPLIER records: 50 updates
   - Risk: Updates may be missed → data inconsistency
*/

Network Model: Reduced Redundancy

The network model's ability to share records across multiple owners eliminates most structural redundancy:

network_no_redundancy.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
/* Network approach - no redundancy: */
 
SUPPLIER record (stored ONCE):
    - SupplierID
    - SupplierName
    - SupplierAddress
    - SupplierRating
 
PRODUCT record (stored ONCE):
    - ProductID
    - ProductName
 
SUPPLY record (intersection, one per relationship):
    - Member of SUPPLIER_SUPPLY set (owner: SUPPLIER)
    - Member of PRODUCT_SUPPLY set (owner: PRODUCT)
    - Contains: Price, LeadTime, Quantity available
 
/* If Supplier "Acme Corp" supplies 50 products:
   - 1 SUPPLIER record for Acme Corp
   - 50 SUPPLY records linking Acme to products
   - Supplier details appear ONCE
   
   If Acme Corp moves to a new address:
   - Update 1 SUPPLIER record
   - All 50 product relationships automatically reflect correct info
   - No risk of inconsistency
*/

Redundancy Implications
Factor	Hierarchical Impact	Network Impact
Storage efficiency	Lower—duplicated data consumes space	Higher—shared records, less duplication
Update operations	Multiple updates for one logical change	Single update propagates via pointers
Consistency risk	High—missed updates cause inconsistency	Low—one source of truth
Delete complexity	Must delete all copies	Delete once, set membership removed
Application burden	Must coordinate multi-point updates	DBMS maintains relationship integrity

Performance Characteristics and Trade-offs

Both models deliver excellent performance for their intended workloads, but their performance profiles differ significantly.

Hierarchical Model Performance Advantages:

Simpler Navigation: Only one path exists from root to any record. No ambiguity, no choice overhead.
Physical Clustering: Related segments can be stored contiguously on disk (adjacent in a HIDAM or HDAM structure), improving I/O for tree traversals.
Predictable Access Patterns: Top-down access aligns with physical storage, yielding consistent performance.
Less Pointer Overhead: Each record has only one parent pointer (or implicit adjacency), reducing storage overhead.

hierarchical_performance.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
/* Hierarchical performance scenario: Bill of Materials */
 
/* Navigate from Product to all components (tree traversal) */
 
Product: "Bicycle"
    ├── Component: "Frame"
    │       ├── Subcomponent: "Steel Tube"
    │       └── Subcomponent: "Welded Joint"
    ├── Component: "Wheels"
    │       ├── Subcomponent: "Tire"
    │       └── Subcomponent: "Rim"
    └── Component: "Handlebars"
 
/* Access Pattern:
   GU (Get Unique) PRODUCT WHERE ProductID = 'Bicycle'
   GNP (Get Next within Parent) to iterate children
   
   Physical Storage: All segments stored contiguously
   I/O Pattern: Sequential read through related data
   Performance: Excellent - minimal disk seeks
 
   This is the IDEAL use case for hierarchical:
   - True tree structure
   - Access always from root downward
   - Components genuinely "belong" to products
*/

Network Model Performance Advantages:

Multiple Access Paths: Can reach data from any direction; application chooses optimal path for each query.
M:N Traversal Efficiency: Following pointers through intersection records is faster than IMS logical relationships.
Bidirectional Navigation: Can traverse from member to owner without searching; owner pointer is explicit.
Set Indexing: Some implementations allow indexed access within sets, further optimizing member lookup.

network_performance.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
/* Network performance scenario: Multi-path queries */
 
/* Query: Find all projects for employee E1001 */
 
-- Path 1: Start from Employee
FIND ANY EMPLOYEE USING EmployeeID = "E1001".  -- O(1) via CALC
FIND FIRST ASSIGNMENT WITHIN EMP_ASSIGNMENTS.
PERFORM navigate through assignments to get projects...
 
/* Query: Find all employees on project P100 */
 
-- Path 2: Start from Project  
FIND ANY PROJECT USING ProjectCode = "P100".  -- O(1) via CALC
FIND FIRST ASSIGNMENT WITHIN PROJECT_ASSIGNMENTS.
PERFORM navigate through assignments to get employees...
 
/* Both queries are efficient!
   The same ASSIGNMENT records are accessed via different sets.
   No duplication, no complex logical relationships.
   
   In hierarchical model, one direction would be fast,
   the other would require searching or secondary indexes.
*/

Performance Profile Comparison
Access Pattern	Hierarchical	Network
Root-to-leaf traversal	Excellent (optimal case)	Good
Leaf-to-root navigation	Poor (requires search or secondary access)	Excellent (follow owner pointer)
M:N traversal	Poor (complex logical relationships)	Excellent (direct pointer chains)
Multi-path queries	Difficult (may need separate hierarchies)	Natural (navigate via different sets)
Sequential scan of type	Good (segment type search)	Good (system-owned sets)
Storage overhead	Lower (simpler pointers)	Higher (multiple set pointers per record)

Programming Complexity and Developer Experience

Both models require explicit navigational programming, but the complexity differs based on structural characteristics.

Hierarchical (IMS) Programming

•Single path simplicity: Only one way to reach any segment
•Segment Search Arguments (SSA): Specify search criteria per segment level
•GU/GN/GNP calls: Intuitive Get Unique, Get Next hierarchy
•Implied currency: Parent-child relationships are implicit in program flow
•Limited flexibility: Must follow hierarchy even when not natural for query

Network (CODASYL) Programming

•Multiple path flexibility: Many ways to reach any record
•Set-based navigation: FIND FIRST/NEXT/OWNER within specific sets
•Explicit currency: Must understand and manage currency indicators
•More DML verbs: FIND, GET, CONNECT, DISCONNECT, RECONNECT, etc.
•Greater power/complexity: More operations = more to learn and get wrong

programming_comparison.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
/* ================================================================
   SAME QUERY: List all employees in department "Engineering"
   ================================================================ */
 
/* IMS HIERARCHICAL (DL/I calls): */
 
MOVE 'ENGINEERING' TO DEPT-NAME-SSA.
CALL 'CBLTDLI' USING GU, PCB, DEPT-IO-AREA, DEPT-SSA.
IF STATUS-CODE = SPACES
    CALL 'CBLTDLI' USING GNP, PCB, EMP-IO-AREA.
    PERFORM UNTIL STATUS-CODE = 'GE'  -- No more children
        DISPLAY EMP-NAME
        CALL 'CBLTDLI' USING GNP, PCB, EMP-IO-AREA
    END-PERFORM.
END-IF.
 
/* Explanation:
   - GU: Get Unique root segment matching SSA
   - GNP: Get Next segment within Parent
   - 'GE' status: End of parent's children
   - Navigation is strictly parent→child
*/
 
 
/* CODASYL NETWORK: */
 
FIND ANY DEPARTMENT USING DeptName = "ENGINEERING".
IF DB-STATUS = "0000000"
    FIND FIRST EMPLOYEE WITHIN DEPT_EMPLOYEE.
    PERFORM UNTIL DB-STATUS = "0502100"  -- End of set
        GET EMPLOYEE.
        DISPLAY EMP-NAME IN EMPLOYEE.
        FIND NEXT EMPLOYEE WITHIN DEPT_EMPLOYEE.
    END-PERFORM.
END-IF.
 
/* Explanation:
   - FIND ANY: Direct access via CALC key
   - FIND FIRST/NEXT: Set traversal
   - GET: Retrieve data (separate from FIND)
   - Can navigate any direction from here
*/
 
 
/* Complexity comparison for THIS query:
   - Both are fairly straightforward
   - IMS is slightly more compact (combined search arguments)
   - CODASYL has clearer separation of FIND vs GET
   - Similar cognitive load for simple hierarchy traversal
*/

Where Complexity Diverges:

For simple hierarchical queries, both models are manageable. The differences emerge with:

M:N Relationships: CODASYL handles naturally; IMS requires complex logical relationship programming
Multi-directional Queries: CODASYL navigates freely; IMS requires separate access paths or secondary processing
Currency Management: CODASYL's multiple currency types require more careful state management than IMS's simpler parent-child position
Schema Evolution: Changing structure in IMS often requires application rewrites; CODASYL set changes can sometimes be isolated

Neither Was Simple

Both models required programmers to understand database structure intimately and code explicit navigation. Both had steep learning curves. The key difference: IMS was simpler for pure hierarchical data, while CODASYL was more capable for complex relationships. Neither approached the ease of SQL declarative queries.

Use Case Alignment: Where Each Model Excels

Each model aligns better with certain data patterns and application requirements.

Hierarchical Model Ideal Use Cases:

Hierarchical Strengths

•Bill of Materials (BOM): Products → Assemblies → Components → Sub-components. True tree structure; navigation always from product down.
•Organizational Charts: Company → Divisions → Departments → Teams → Employees. Clear reporting hierarchy.
•Document Management: Library → Category → Document → Version. Containment relationships.
•File Systems: Directory → Subdirectory → File. Classic tree structure.
•Parsing/Syntax Trees: Expression trees, XML/JSON document structures. Naturally hierarchical.

Network Model Ideal Use Cases:

Network Strengths

•Student/Course Enrollment: Students ↔ Courses through Enrollments. Classic M:N relationship.
•Supplier/Part Relationships: Suppliers ↔ Parts through Supply records. Multiple suppliers per part.
•Employee/Project Assignments: Employees ↔ Projects through Assignments. Shared resources.
•Authors/Publications: Authors ↔ Papers through Authorship. Multi-author works.
•Social Networks: People ↔ People through Connections. Symmetric many-to-many.

Model Selection Guidelines
Criterion	Choose Hierarchical If...	Choose Network If...
Relationships	Purely 1:N, strict containment	M:N or bidirectional access needed
Access patterns	Always root-to-leaf	Multiple entry points, varied directions
Data redundancy	Acceptable (or natural)	Must be minimized
Query flexibility	Predictable, tree-matching queries	Ad-hoc traversal patterns
Development team	Familiar with IMS patterns	Need CODASYL relationship flexibility
Legacy integration	Existing IMS infrastructure	CODASYL-based systems in place

Real-World Complexity

In practice, most real-world domains combine hierarchical and network patterns. An organization might have a true hierarchy (org chart) plus many-to-many relationships (employees on projects). This forced organizations to either accept limitations or maintain hybrid approaches—contributing to the appeal of the more flexible relational model.

Historical Evolution and Market Dynamics

The competition between hierarchical and network models shaped the database industry for two decades before both were largely superseded by the relational model.

Timeline of Dominance:

Database Model Timeline
Era	Dominant Models	Key Developments
1960-1965	File systems, early experiments	IDS at GE, GUAM at IBM (IMS precursor)
1965-1970	Hierarchical emergence	IMS released (1968), CODASYL DBTG formed (1967)
1970-1975	Peak navigational era	CODASYL report (1971), Codd's relational papers (1970)
1975-1980	Coexistence	IDMS, IDS/II flourish; System R, INGRES prove relational
1980-1985	Relational rise	Oracle, DB2 gain enterprise adoption; SQL standardized
1985-1995	Relational dominance	RDBMS become default; hierarchical/network in maintenance
1995-present	Post-relational diversification	Object, NoSQL, graph databases; IMS and IDMS surviving in legacy

Why the Relational Model Won:

Both hierarchical and network models lost market dominance to the relational model for several key reasons:

Data Independence: Relational databases separate logical structure from physical storage and from application logic. Programs don't encode navigation paths.
Declarative Queries: SQL specifies what data is needed, not how to retrieve it. The optimizer handles access paths.
Ad-Hoc Query Capability: Business users can write SQL queries without programmer intervention. Both IMS DL/I and CODASYL DML require programming.
Schema Flexibility: Adding an index or changing storage structure doesn't require application changes. Navigational models embedded structure in programs.
Vendor Competition: Multiple relational vendors (Oracle, IBM DB2, SQL Server, Informix) drove innovation and lowered costs. IMS was IBM-only; CODASYL implementations fragmented.

Legacy Survival

Despite losing market share decades ago, IMS and IDMS systems still run in production at major corporations and government agencies. Migration costs and risks often outweigh benefits for stable, high-volume transaction systems. Some banks process millions of transactions daily on IMS databases designed in the 1970s.

Summary: Two Paradigms, One Transition

The hierarchical and network models represent two valid approaches to structuring databases, each with distinct strengths and limitations shaped by their fundamental architectural decisions.

Key Takeaways

•Structural difference is fundamental: Hierarchical (tree) enforces single-parent; network (graph) allows multiple parents via different set types.
•M:N relationship handling distinguishes them: Hierarchical requires workarounds with duplication or logical relationships; network models naturally via intersection records.
•Data redundancy impacts differ: Hierarchical's single-parent constraint often forces duplication; network's shared records minimize redundancy.
•Performance profiles vary by workload: Hierarchical excels at tree-shaped, top-down access; network excels at multi-path, bidirectional traversal.
•Both require navigational programming: Neither offers declarative query capability, contributing to their replacement by relational databases.
•The relational model superseded both: Data independence, SQL declarative queries, and schema flexibility proved more important than navigational efficiency.
•Legacy systems persist: Mission-critical IMS and IDMS systems continue operating decades after installation, testament to their reliability.

Module Conclusion:

With this comparison, we complete our exploration of the network data model. We've traveled from fundamental graph structure concepts through CODASYL standardization, set relationship semantics, navigational programming patterns, and finally this comparison with the hierarchical approach.

The network model represented a significant advancement in data modeling capability—freeing designers from the rigid tree structure to represent complex real-world relationships naturally. Its influence persists in modern graph databases that once again embrace explicit relationship modeling and navigational queries, albeit with more user-friendly interfaces.

Understanding these historical models provides perspective on database design trade-offs that remain relevant: simplicity versus expressiveness, performance versus flexibility, and the eternal question of how much control to give programmers versus how much to abstract into the system.

Module Complete

You have completed Module 3: Network Model. You now understand graph structure fundamentals, CODASYL standardization, set relationship semantics, navigational programming, and how the network model compares to its hierarchical predecessor. This knowledge provides essential context for understanding database evolution and the design principles that shape modern systems.

5 / 5

Loading learning content...

Database Management SystemsNetwork Model

The Network Data Model

LevelIntermediate

Duration60 mins

TopicNetwork Model

5 / 5

Comparison with Hierarchical Model — A Tale of Two Paradigms

Two Paths from the Same Origin

Learning Objectives

Structural Foundations: Trees vs. Graphs

The most fundamental difference lies in the underlying graph structures:

Hierarchical Model (Tree Structure):

Single root: Each database area has exactly one root record type
Single parent constraint: Every non-root record has exactly one parent
No cycles: The structure forms a Directed Acyclic Graph (specifically, a tree)
Implicit relationships: Parent-child links are the only relationships
Top-down access: Navigation primarily flows from root toward leaves

Network Model (Graph Structure):

Multiple entry points: Any record type can serve as a starting point
Multiple parents allowed: A record can be a member of multiple set types
Cycles permitted: The graph can contain circular paths
Named relationships: Set types explicitly define and name relationships
Multi-directional access: Navigate up, down, or across the graph

Converting Mermaid diagram...

Structural Property Comparison
Property	Hierarchical (Tree)	Network (Graph)
Mathematical structure	Rooted tree (arborescence)	Directed graph (may have cycles)
Parent count per node	Exactly 1 (except root: 0)	0, 1, or many (via different sets)
Paths between nodes	Exactly 1	Potentially many
Root requirement	Mandatory—one per hierarchy	None—any record can be an entry point
Relationship cardinality	1:N only (implicit)	1:N per set, M:N via intersection records
Navigation direction	Primarily parent→child	Any direction: owner→member, member→owner, across sets

The Single-Parent Constraint

Relationship Modeling Capabilities

The most significant practical difference between the models is how they handle various relationship types.

One-to-One (1:1) Relationships:

Both models handle 1:1 relationships adequately:

Hierarchical: Child segment under parent, with single occurrence
Network: Set with at most one member per owner occurrence

One-to-Many (1:N) Relationships:

Both models excel at 1:N—this is their natural structure:

Hierarchical: Parent with multiple children (department → employees)
Network: Owner with multiple members (department owns employee set)

Many-to-Many (M:N) Relationships:

This is where the models diverge dramatically.

Hierarchical: M:N Workarounds

The hierarchical model cannot directly represent M:N. Workarounds include:

Virtual Pairing: Create intersection segments under each parent (data duplication)

Logical Pointers: Store key values that reference records elsewhere (loses navigational performance)

Hierarchy Duplication: Repeat entire hierarchies from different perspectives

All workarounds have significant drawbacks.

Network: M:N Native Support

The network model handles M:N naturally:

Intersection Records: Create a record type that is a member of sets from both parents

Dual Ownership: The intersection record has two owners via different sets

No Duplication: Each entity appears once; relationships connect them

M:N is expressed without data redundancy.

many_to_many_comparison.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
/* ================================================================
   EXAMPLE: Students enrolled in Courses (M:N Relationship)
   - Each student can enroll in many courses
   - Each course can have many enrolled students
   ================================================================ */
 
/* HIERARCHICAL MODEL APPROACH */
 
Database Schema (IMS-style):
    COURSE (root)
        └── ENROLLMENT (child with pointer to STUDENT)
            Contains: STUDENT-KEY (logical child pointer)
            
    STUDENT (separate root in another hierarchy)
        └── S-ENROLLMENT (virtual segment)
            Physical twin of ENROLLMENT under COURSE
            
/* Problems:
   - ENROLLMENT data may be duplicated
   - Must maintain both paths in sync
   - Queries from student perspective require different access path
   - Complex pointer management */
 
 
/* NETWORK MODEL APPROACH */
 
    COURSE ──[COURSE_ENROLLMENT SET]──> ENROLLMENT
    STUDENT ──[STUDENT_ENROLLMENT SET]──> ENROLLMENT
    
    ENROLLMENT is a member of TWO sets:
    - COURSE_ENROLLMENT: owned by COURSE
    - STUDENT_ENROLLMENT: owned by STUDENT
 
/* Benefits:
   - Single ENROLLMENT record per student-course pair
   - Navigate from COURSE through ENROLLMENT to retrieve students
   - Navigate from STUDENT through ENROLLMENT to retrieve courses
   - No data duplication
   - Pointer chains maintained by DBMS */
 
/* Navigation from COURSE to get all enrolled STUDENTS: */
 
FIND ANY COURSE USING CourseCode = "CS101".
FIND FIRST ENROLLMENT WITHIN COURSE_ENROLLMENT.
PERFORM UNTIL end-of-set
    -- Navigate via other set to get student
    FIND OWNER WITHIN STUDENT_ENROLLMENT.
    GET STUDENT.
    DISPLAY StudentName.
    -- Return and continue
    FIND NEXT ENROLLMENT WITHIN COURSE_ENROLLMENT.
END-PERFORM.

IMS Logical Relationships

Data Redundancy and Consistency

The structural constraints of each model have profound implications for data redundancy and consistency maintenance.

Hierarchical Model Redundancy:

Because each record can have only one parent, data that belongs to multiple contexts must often be duplicated:

hierarchical_redundancy.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
/* Example: Supplier information in a manufacturing database */
 
/* Hierarchical approach - redundancy required: */
 
HIERARCHY 1: PRODUCT-focused
    PRODUCT
        └── PRODUCT_COMPONENT
            └── COMPONENT_SUPPLIER  // Contains supplier details
                - SupplierName
                - SupplierAddress
                - SupplierRating
 
HIERARCHY 2: SUPPLIER-focused  
    SUPPLIER  // Full supplier record
        └── SUPPLIER_PRODUCT  // Products they supply
 
/* Problem: If Supplier "Acme Corp" supplies 50 products,
   their name/address/rating appears in:
   - Once in SUPPLIER hierarchy
   - 50 times in COMPONENT_SUPPLIER segments under each product
   
   If Acme Corp moves to a new address:
   - Must update SUPPLIER record: 1 update
   - Must update 50 COMPONENT_SUPPLIER records: 50 updates
   - Risk: Updates may be missed → data inconsistency
*/

Network Model: Reduced Redundancy

The network model's ability to share records across multiple owners eliminates most structural redundancy:

network_no_redundancy.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
/* Network approach - no redundancy: */
 
SUPPLIER record (stored ONCE):
    - SupplierID
    - SupplierName
    - SupplierAddress
    - SupplierRating
 
PRODUCT record (stored ONCE):
    - ProductID
    - ProductName
 
SUPPLY record (intersection, one per relationship):
    - Member of SUPPLIER_SUPPLY set (owner: SUPPLIER)
    - Member of PRODUCT_SUPPLY set (owner: PRODUCT)
    - Contains: Price, LeadTime, Quantity available
 
/* If Supplier "Acme Corp" supplies 50 products:
   - 1 SUPPLIER record for Acme Corp
   - 50 SUPPLY records linking Acme to products
   - Supplier details appear ONCE
   
   If Acme Corp moves to a new address:
   - Update 1 SUPPLIER record
   - All 50 product relationships automatically reflect correct info
   - No risk of inconsistency
*/

Redundancy Implications
Factor	Hierarchical Impact	Network Impact
Storage efficiency	Lower—duplicated data consumes space	Higher—shared records, less duplication
Update operations	Multiple updates for one logical change	Single update propagates via pointers
Consistency risk	High—missed updates cause inconsistency	Low—one source of truth
Delete complexity	Must delete all copies	Delete once, set membership removed
Application burden	Must coordinate multi-point updates	DBMS maintains relationship integrity

Performance Characteristics and Trade-offs

Both models deliver excellent performance for their intended workloads, but their performance profiles differ significantly.

Hierarchical Model Performance Advantages:

Simpler Navigation: Only one path exists from root to any record. No ambiguity, no choice overhead.
Physical Clustering: Related segments can be stored contiguously on disk (adjacent in a HIDAM or HDAM structure), improving I/O for tree traversals.
Predictable Access Patterns: Top-down access aligns with physical storage, yielding consistent performance.
Less Pointer Overhead: Each record has only one parent pointer (or implicit adjacency), reducing storage overhead.

hierarchical_performance.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
/* Hierarchical performance scenario: Bill of Materials */
 
/* Navigate from Product to all components (tree traversal) */
 
Product: "Bicycle"
    ├── Component: "Frame"
    │       ├── Subcomponent: "Steel Tube"
    │       └── Subcomponent: "Welded Joint"
    ├── Component: "Wheels"
    │       ├── Subcomponent: "Tire"
    │       └── Subcomponent: "Rim"
    └── Component: "Handlebars"
 
/* Access Pattern:
   GU (Get Unique) PRODUCT WHERE ProductID = 'Bicycle'
   GNP (Get Next within Parent) to iterate children
   
   Physical Storage: All segments stored contiguously
   I/O Pattern: Sequential read through related data
   Performance: Excellent - minimal disk seeks
 
   This is the IDEAL use case for hierarchical:
   - True tree structure
   - Access always from root downward
   - Components genuinely "belong" to products
*/

Network Model Performance Advantages:

Multiple Access Paths: Can reach data from any direction; application chooses optimal path for each query.
M:N Traversal Efficiency: Following pointers through intersection records is faster than IMS logical relationships.
Bidirectional Navigation: Can traverse from member to owner without searching; owner pointer is explicit.
Set Indexing: Some implementations allow indexed access within sets, further optimizing member lookup.

network_performance.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
/* Network performance scenario: Multi-path queries */
 
/* Query: Find all projects for employee E1001 */
 
-- Path 1: Start from Employee
FIND ANY EMPLOYEE USING EmployeeID = "E1001".  -- O(1) via CALC
FIND FIRST ASSIGNMENT WITHIN EMP_ASSIGNMENTS.
PERFORM navigate through assignments to get projects...
 
/* Query: Find all employees on project P100 */
 
-- Path 2: Start from Project  
FIND ANY PROJECT USING ProjectCode = "P100".  -- O(1) via CALC
FIND FIRST ASSIGNMENT WITHIN PROJECT_ASSIGNMENTS.
PERFORM navigate through assignments to get employees...
 
/* Both queries are efficient!
   The same ASSIGNMENT records are accessed via different sets.
   No duplication, no complex logical relationships.
   
   In hierarchical model, one direction would be fast,
   the other would require searching or secondary indexes.
*/

Performance Profile Comparison
Access Pattern	Hierarchical	Network
Root-to-leaf traversal	Excellent (optimal case)	Good
Leaf-to-root navigation	Poor (requires search or secondary access)	Excellent (follow owner pointer)
M:N traversal	Poor (complex logical relationships)	Excellent (direct pointer chains)
Multi-path queries	Difficult (may need separate hierarchies)	Natural (navigate via different sets)
Sequential scan of type	Good (segment type search)	Good (system-owned sets)
Storage overhead	Lower (simpler pointers)	Higher (multiple set pointers per record)

Programming Complexity and Developer Experience

Both models require explicit navigational programming, but the complexity differs based on structural characteristics.

Hierarchical (IMS) Programming

•Single path simplicity: Only one way to reach any segment
•Segment Search Arguments (SSA): Specify search criteria per segment level
•GU/GN/GNP calls: Intuitive Get Unique, Get Next hierarchy
•Implied currency: Parent-child relationships are implicit in program flow
•Limited flexibility: Must follow hierarchy even when not natural for query

Network (CODASYL) Programming

•Multiple path flexibility: Many ways to reach any record
•Set-based navigation: FIND FIRST/NEXT/OWNER within specific sets
•Explicit currency: Must understand and manage currency indicators
•More DML verbs: FIND, GET, CONNECT, DISCONNECT, RECONNECT, etc.
•Greater power/complexity: More operations = more to learn and get wrong

programming_comparison.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
/* ================================================================
   SAME QUERY: List all employees in department "Engineering"
   ================================================================ */
 
/* IMS HIERARCHICAL (DL/I calls): */
 
MOVE 'ENGINEERING' TO DEPT-NAME-SSA.
CALL 'CBLTDLI' USING GU, PCB, DEPT-IO-AREA, DEPT-SSA.
IF STATUS-CODE = SPACES
    CALL 'CBLTDLI' USING GNP, PCB, EMP-IO-AREA.
    PERFORM UNTIL STATUS-CODE = 'GE'  -- No more children
        DISPLAY EMP-NAME
        CALL 'CBLTDLI' USING GNP, PCB, EMP-IO-AREA
    END-PERFORM.
END-IF.
 
/* Explanation:
   - GU: Get Unique root segment matching SSA
   - GNP: Get Next segment within Parent
   - 'GE' status: End of parent's children
   - Navigation is strictly parent→child
*/
 
 
/* CODASYL NETWORK: */
 
FIND ANY DEPARTMENT USING DeptName = "ENGINEERING".
IF DB-STATUS = "0000000"
    FIND FIRST EMPLOYEE WITHIN DEPT_EMPLOYEE.
    PERFORM UNTIL DB-STATUS = "0502100"  -- End of set
        GET EMPLOYEE.
        DISPLAY EMP-NAME IN EMPLOYEE.
        FIND NEXT EMPLOYEE WITHIN DEPT_EMPLOYEE.
    END-PERFORM.
END-IF.
 
/* Explanation:
   - FIND ANY: Direct access via CALC key
   - FIND FIRST/NEXT: Set traversal
   - GET: Retrieve data (separate from FIND)
   - Can navigate any direction from here
*/
 
 
/* Complexity comparison for THIS query:
   - Both are fairly straightforward
   - IMS is slightly more compact (combined search arguments)
   - CODASYL has clearer separation of FIND vs GET
   - Similar cognitive load for simple hierarchy traversal
*/

Where Complexity Diverges:

For simple hierarchical queries, both models are manageable. The differences emerge with:

M:N Relationships: CODASYL handles naturally; IMS requires complex logical relationship programming
Multi-directional Queries: CODASYL navigates freely; IMS requires separate access paths or secondary processing
Currency Management: CODASYL's multiple currency types require more careful state management than IMS's simpler parent-child position
Schema Evolution: Changing structure in IMS often requires application rewrites; CODASYL set changes can sometimes be isolated

Neither Was Simple

Use Case Alignment: Where Each Model Excels

Each model aligns better with certain data patterns and application requirements.

Hierarchical Model Ideal Use Cases:

Hierarchical Strengths

•Bill of Materials (BOM): Products → Assemblies → Components → Sub-components. True tree structure; navigation always from product down.
•Organizational Charts: Company → Divisions → Departments → Teams → Employees. Clear reporting hierarchy.
•Document Management: Library → Category → Document → Version. Containment relationships.
•File Systems: Directory → Subdirectory → File. Classic tree structure.
•Parsing/Syntax Trees: Expression trees, XML/JSON document structures. Naturally hierarchical.

Network Model Ideal Use Cases:

Network Strengths

•Student/Course Enrollment: Students ↔ Courses through Enrollments. Classic M:N relationship.
•Supplier/Part Relationships: Suppliers ↔ Parts through Supply records. Multiple suppliers per part.
•Employee/Project Assignments: Employees ↔ Projects through Assignments. Shared resources.
•Authors/Publications: Authors ↔ Papers through Authorship. Multi-author works.
•Social Networks: People ↔ People through Connections. Symmetric many-to-many.

Model Selection Guidelines
Criterion	Choose Hierarchical If...	Choose Network If...
Relationships	Purely 1:N, strict containment	M:N or bidirectional access needed
Access patterns	Always root-to-leaf	Multiple entry points, varied directions
Data redundancy	Acceptable (or natural)	Must be minimized
Query flexibility	Predictable, tree-matching queries	Ad-hoc traversal patterns
Development team	Familiar with IMS patterns	Need CODASYL relationship flexibility
Legacy integration	Existing IMS infrastructure	CODASYL-based systems in place

Real-World Complexity

Historical Evolution and Market Dynamics

The competition between hierarchical and network models shaped the database industry for two decades before both were largely superseded by the relational model.

Timeline of Dominance:

Database Model Timeline
Era	Dominant Models	Key Developments
1960-1965	File systems, early experiments	IDS at GE, GUAM at IBM (IMS precursor)
1965-1970	Hierarchical emergence	IMS released (1968), CODASYL DBTG formed (1967)
1970-1975	Peak navigational era	CODASYL report (1971), Codd's relational papers (1970)
1975-1980	Coexistence	IDMS, IDS/II flourish; System R, INGRES prove relational
1980-1985	Relational rise	Oracle, DB2 gain enterprise adoption; SQL standardized
1985-1995	Relational dominance	RDBMS become default; hierarchical/network in maintenance
1995-present	Post-relational diversification	Object, NoSQL, graph databases; IMS and IDMS surviving in legacy

Why the Relational Model Won:

Both hierarchical and network models lost market dominance to the relational model for several key reasons:

Data Independence: Relational databases separate logical structure from physical storage and from application logic. Programs don't encode navigation paths.
Declarative Queries: SQL specifies what data is needed, not how to retrieve it. The optimizer handles access paths.
Ad-Hoc Query Capability: Business users can write SQL queries without programmer intervention. Both IMS DL/I and CODASYL DML require programming.
Schema Flexibility: Adding an index or changing storage structure doesn't require application changes. Navigational models embedded structure in programs.
Vendor Competition: Multiple relational vendors (Oracle, IBM DB2, SQL Server, Informix) drove innovation and lowered costs. IMS was IBM-only; CODASYL implementations fragmented.

Legacy Survival

Summary: Two Paradigms, One Transition

The hierarchical and network models represent two valid approaches to structuring databases, each with distinct strengths and limitations shaped by their fundamental architectural decisions.

Key Takeaways

•Structural difference is fundamental: Hierarchical (tree) enforces single-parent; network (graph) allows multiple parents via different set types.
•M:N relationship handling distinguishes them: Hierarchical requires workarounds with duplication or logical relationships; network models naturally via intersection records.
•Data redundancy impacts differ: Hierarchical's single-parent constraint often forces duplication; network's shared records minimize redundancy.
•Performance profiles vary by workload: Hierarchical excels at tree-shaped, top-down access; network excels at multi-path, bidirectional traversal.
•Both require navigational programming: Neither offers declarative query capability, contributing to their replacement by relational databases.
•The relational model superseded both: Data independence, SQL declarative queries, and schema flexibility proved more important than navigational efficiency.
•Legacy systems persist: Mission-critical IMS and IDMS systems continue operating decades after installation, testament to their reliability.

Module Conclusion:

Module Complete

5 / 5