Skip to content

Lineage Management

Overview

Lineage Management in Osprey provides a comprehensive framework for defining and tracking data relationships between assets in your industrial systems. This feature enables you to document how data flows through your infrastructure, creating a complete picture of your data ecosystem for better decision-making, troubleshooting, and compliance.

Understanding Data Lineage

What is Data Lineage?

Data lineage is the documentation of data flow through your systems - from its origin to its final destination, including all transformations, calculations, and transfers along the way. In industrial environments, this typically involves:

  • Source Systems: Where data originates (sensors, PLCs, field devices)
  • Transfer Mechanisms: How data moves between systems (PI Interfaces, network connections)
  • Processing Points: Where data is calculated, aggregated, or transformed
  • Storage Systems: Where data is archived or made available for analysis
  • Consumption Points: Where data is used for reporting, visualization, or control

Why Lineage Management Matters

Operational Benefits: - Root Cause Analysis: Quickly trace data quality issues back to their source - Impact Assessment: Understand which downstream systems are affected by upstream changes - System Dependencies: Visualize critical data paths and potential failure points - Performance Optimization: Identify bottlenecks and redundancies in data flow

Compliance and Governance: - Audit Trails: Document data provenance for regulatory requirements - Data Quality: Track data transformations and validation points - Change Management: Assess the impact of system modifications - Documentation: Maintain comprehensive records of data architecture

Strategic Planning: - Migration Support: Plan system upgrades and data migrations effectively - Integration Planning: Understand existing connections when adding new systems - Capacity Planning: Identify high-traffic data paths and plan for growth - Risk Management: Assess critical data paths and implement appropriate redundancy

Lineage Management Approaches

Osprey provides two complementary approaches to lineage management:

1. Manual Lineage

Define explicit, individual relationships between specific assets when automated discovery isn't sufficient or when you need to document business logic relationships.

Best For: - Custom business logic and calculated relationships - Cross-system connections and integrations - Exceptional cases and complex configurations - Legacy systems without automated discovery capabilities

Key Features: - Precise control over relationship definition - Support for any asset type or system - Immediate relationship creation - Custom business logic documentation

2. PI Interface Lineage

Configure templates for PI-to-PI interface connections that automatically generate lineage relationships for matching tags across different PI Data Archives.

Best For: - Standard PI-to-PI interface configurations - Large-scale automated lineage creation - Consistent interface patterns across multiple systems - Dynamic environments with frequent changes

Key Features: - Automated relationship creation - Template-based configuration - Scale handling for thousands of tags - Automatic maintenance and updates

Choosing the Right Approach

Use Manual Lineage When:

  • Custom Logic: You need to document calculated or derived relationships
  • Cross-Domain: Relationships span different systems or security boundaries
  • Exceptions: Standard patterns don't apply to your specific case
  • Legacy: Working with older systems without modern interface capabilities
  • Business Context: Relationships represent business logic rather than technical connections

Use PI Interface Lineage When:

  • Standard Interfaces: You have PI-to-PI interfaces with consistent patterns
  • Scale: Managing hundreds or thousands of interface tags
  • Automation: You want relationships to update automatically
  • Consistency: You need uniform lineage across similar interfaces
  • Maintenance: You want to minimize manual lineage management overhead

Combined Approach:

Many organizations benefit from using both approaches: - PI Interface Lineage for standard, repeatable interface patterns - Manual Lineage for custom logic, exceptions, and cross-system relationships

Lineage Visualization and Usage

Accessing Lineage Information

Once relationships are defined, you can view and use lineage information throughout Osprey:

Asset Detail Pages: - View upstream and downstream connections for individual assets - Trace data flow paths from any asset - Understand data dependencies and impact relationships

Lineage Diagrams: - Visual representation of data flow through your systems - Interactive exploration of complex data relationships - Impact analysis for planned changes or troubleshooting

Reports and Analytics: - Generate reports on data flow patterns - Analyze system dependencies and critical paths - Support compliance and audit requirements

Integration with Other Osprey Features

Enhanced Asset Management

Explorer Page: - Lineage relationships enhance asset browsing and discovery - Related assets are easier to find and understand - Data flow context improves asset management decisions

Improved Issue Resolution

Issues Page: - Lineage information helps trace data quality problems to their source - Impact analysis shows which assets are affected by upstream issues - Root cause analysis is more effective with complete lineage information

Advanced Monitoring

Views and Alerts: - Create views that include lineage-related assets - Set up alerts that consider data flow dependencies - Monitor critical data paths for performance or quality issues

Comprehensive Reporting

Analytics and Business Intelligence: - Include lineage information in compliance reports - Analyze data flow patterns for optimization opportunities - Support business intelligence initiatives with comprehensive data provenance

Best Practices for Lineage Management

Planning and Strategy

Assessment First: - Understand your data flow patterns before implementing lineage - Identify critical data paths that require documentation - Determine which approach (manual, PI interface, or both) best fits your needs

Start Small: - Begin with critical systems or well-understood data flows - Expand lineage coverage gradually as you gain experience - Focus on high-value relationships first

Implementation Guidelines

Documentation Standards: - Establish consistent naming conventions and documentation practices - Maintain clear records of why relationships were created - Keep track of relationship ownership and maintenance responsibilities

Quality Assurance: - Regularly validate lineage accuracy against actual system behavior - Implement review processes for critical lineage relationships - Monitor for changes that might affect lineage accuracy

Maintenance and Governance

Change Management: - Update lineage documentation when systems change - Coordinate lineage management with system upgrade and migration projects - Maintain historical records of lineage changes

Training and Knowledge Transfer: - Train team members on lineage management principles and tools - Document procedures for lineage creation and maintenance - Establish expertise and backup coverage for lineage management

Getting Started

Quick Start Guide

  1. Assess Your Environment: Identify the types of data relationships you need to document
  2. Choose Your Approach: Determine whether to start with manual lineage, PI interface lineage, or both
  3. Plan Your Implementation: Start with critical systems and expand gradually
  4. Create Initial Relationships: Follow the detailed guides for your chosen approach
  5. Validate and Refine: Test lineage accuracy and adjust as needed
  6. Establish Maintenance: Set up processes for ongoing lineage management

Next Steps

Support and Resources

For additional help with lineage management: - Review the detailed guides for each approach - Contact your domain administrator for permissions and access - Submit support tickets for technical issues - Consult with your data management team for organizational policies

Effective lineage management is a critical foundation for data quality, compliance, and operational excellence in industrial environments. By understanding and implementing the appropriate lineage management approach for your needs, you can create a comprehensive view of your data ecosystem that supports better decision-making and more effective troubleshooting.