Return to Home
Deep Research

The Ontology of Value

A comprehensive interactive guide to financial data classification, architecture, and lifecycle management. Understanding the DNA of modern capital markets.

In the modern financial ecosystem, data is not merely information—it is the structural DNA that enables every transaction, valuation, and risk calculation. From the moment a security is issued to its final settlement, a complex web of identifiers, classifications, and temporal records governs its existence.

This comprehensive guide explores the ontology of financial value: the systematic classification and lifecycle management of financial instruments, entities, transactions, and positions. Whether you're building a trading platform, implementing a portfolio management system, or simply seeking to understand how institutional finance operates at the data layer, this framework provides the foundational architecture.

What You'll Master

  • Product Master: Global identification standards (ISIN, CUSIP, FIGI) and asset-specific attributes
  • Entity & Account Hierarchies: From households to sleeves, understanding ownership structures
  • Transaction Lifecycle: ISO 20022 migration and structured event processing
  • Tax Lot Accounting: Cost basis methodologies and corporate action mathematics
  • The Three Books: IBOR, ABOR, and PBOR—separating trading, accounting, and performance views
  • Performance Attribution: TWRR vs MWRR and GIPS compliance frameworks
  • Risk Architecture: VaR, stress testing, and liquidity classification
  • Data Engineering: Master Data Management (MDM) and bitemporal design patterns

The Product Master

The central nervous system of any investment platform. It defines the universe of tradable assets and their behavioral attributes.

The Product Master is the authoritative registry of all tradable instruments within a financial system. It serves as the single source of truth for security attributes, from basic identifiers to complex behavioral characteristics. Without a robust Product Master, downstream processes—trading, risk management, compliance reporting—become impossible.

Think of it as the "birth certificate" for every financial instrument. Before a security can be traded, valued, or reported, it must first be defined. This definition includes not just "what it is" (equity, bond, derivative), but also "how it behaves" (coupon frequency, voting rights, embedded options).

Global Identification Standards

Before an instrument can be classified, it must be uniquely identified across global markets. Multiple identification schemes exist, each serving different regulatory and operational purposes.

ISIN

International Securities Identification Number. ISO 6166 standard. 12-character alphanumeric code (2-letter country + 9-digit identifier + check digit). Essential for cross-border trading and MiFID II reporting. Example: US0378331005 (Apple Inc.).

CUSIP

Committee on Uniform Securities Identification Procedures. 9-character alphanumeric code used in North America. Crucial for settlement within the DTC (Depository Trust Company) ecosystem. Managed by CUSIP Global Services (CGS).

FIGI

Financial Instrument Global Identifier. An open standard maintained by Bloomberg that remains persistent across corporate actions, unlike tickers which can change. Enables cross-platform instrument matching without vendor lock-in.

LEI

Legal Entity Identifier of the issuer. 20-character alphanumeric code. Mandatory under Dodd-Frank and EMIR for tracking counterparty risk and regulatory reporting. Links instruments to their ultimate parent entities.

Why Multiple Identifiers?

Different markets and regulators mandate different standards. A robust system maintains cross-reference mappings between ISIN, CUSIP, SEDOL, FIGI, and Bloomberg Ticker to ensure seamless data integration across vendors and jurisdictions.

Asset Class Attributes

Beyond identification, each asset class requires specific behavioral metadata to enable accurate valuation, risk analysis, and compliance reporting. The Product Master must capture these nuances.

Equities

  • Voting Rights: Essential for proxy voting workflows. Distinguishes common (voting) from preferred (non-voting) shares. Critical for ESG engagement strategies.
  • Free Float: Percentage of shares available for public trading (excludes insider holdings). Inputs for liquidity risk models and index weighting methodologies (e.g., MSCI uses free-float adjusted market cap).
  • GICS/ICB: Global Industry Classification Standard (MSCI/S&P) or Industry Classification Benchmark (FTSE). Sector classification driving rotation strategies, factor attribution, and benchmark construction.
  • ADR/GDR Ratio: For depositary receipts, the conversion ratio to underlying shares. Example: Alibaba ADR (BABA) represents 8 ordinary shares.
  • Dividend Policy: Frequency (quarterly, annual), ex-dividend dates, and payment dates. Required for income forecasting and total return calculations.

Fixed Income

Bonds are contractual cash flow machines. The Product Master must encode the legal terms that govern these flows.

AttributeDescriptionUsage
Coupon LogicFixed, Floating (LIBOR+spread), Zero-CouponCalculating Accrued Interest (AI)
Day Count Convention30/360, Act/Act, Act/360, Act/365Critical for settlement amount precision
Embedded OptionsCallable, Putable, ConvertibleCalc Yield-to-Worst, OAS, Convexity
SenioritySenior Secured, Senior Unsecured, SubordinatedRecovery rate assumptions in default scenarios
Credit RatingS&P, Moody's, Fitch compositeRegulatory capital requirements (Basel III)

Derivatives

Options, futures, and swaps require contract specifications that define payoff structures.

  • Underlying Asset: Link to Product Master entry (e.g., SPY for SPY options)
  • Contract Multiplier: 100 shares per equity option, $1000 per S&P 500 futures point
  • Exercise Style: American (anytime), European (expiry only), Bermudan (specific dates)
  • Settlement Method: Physical delivery vs. cash settlement
  • Margin Requirements: Initial and maintenance margin for futures/options

Regulatory Classifications: SFDR

The EU's Sustainable Finance Disclosure Regulation (SFDR) mandates product-level sustainability classification. This metadata is now a required field for any fund sold in Europe.

Article 6Integrates sustainability risks only. No explicit ESG promotion.
Article 8 (Light Green)Promotes environmental/social characteristics. Must disclose how characteristics are met.
Article 9 (Dark Green)Sustainable investment is the objective. Strictest reporting requirements.

Visual Framework: The Complete Data Architecture

This comprehensive infographic maps the entire financial data ecosystem—from product identification to performance measurement. Click to explore in full-screen mode.

The Ontology of Value - Complete Financial Data Architecture
Click to view full screen

Entity & Account Master

Defining who is trading and where assets reside. The backbone of client reporting and compliance.

While the Product Master defines what is being traded, the Entity & Account Master defines who owns it and where it resides. This hierarchical structure enables everything from consolidated wealth reporting to regulatory compliance (KYC/AML) to tax optimization strategies.

Modern wealth management platforms support multi-level account hierarchies that mirror real-world ownership structures: from households (family wealth) down to sleeves (sub-account allocations within Unified Managed Accounts).

The Account Hierarchy

A four-tier structure that balances operational flexibility with regulatory requirements:

1. Client / Household

Top-level owner (e.g., "The Smith Family"). Aggregates Total Wealth across all accounts. Used for relationship management and consolidated reporting. Links to CRM systems.

2. Custodial Account

Legal vessel at a custodian bank (e.g., Pershing, Schwab). This is the level for Form 1099-B tax reporting. Each account has a unique account number and legal registration (Individual, Joint, IRA, Trust).

3. Portfolio / Strategy

Logical grouping within an account (e.g., "US Growth Equity", "Fixed Income Core"). Enables model portfolio assignment and performance attribution by strategy. Not a legal entity.

4. Sleeve / Sub-Account

Virtual partition for Unified Managed Accounts (UMAs). Enables multi-manager strategies within a single custodial account. Critical for avoiding wash sales across sleeves while maintaining tax-lot isolation.

Why This Matters

Aggregation flexibility: Report at household level for wealth planning, but trade and reconcile at custodial account level for operational accuracy. Performance attribution happens at portfolio level, while tax optimization occurs at sleeve level.

Legal Entity Controls & Compliance

Every entity in the system must be classified for regulatory compliance. These attributes drive automated screening, reporting, and risk management workflows.

KYC & AML

Know Your Customer and Anti-Money Laundering. Mandatory classification for High Risk entities, PEP (Politically Exposed Persons), or Sanctioned individuals/countries.

Risk Tiers: Low, Medium, High, Prohibited

Screening: OFAC, EU Sanctions, UN Sanctions lists

Refresh Frequency: Daily for High Risk, Quarterly for others

FATCA / CRS

Foreign Account Tax Compliance Act (US) and Common Reporting Standard (OECD). Tax residency classification to facilitate global tax information exchange.

W-8/W-9 Forms: Capture tax residency and withholding status

Reportable Accounts: Automatic exchange with tax authorities

Withholding Rates: 0%, 15%, 30% based on treaty status

Counterparty Risk

Linking issuers to ultimate parents (e.g., Apple Inc. → Apple Inc. Parent) to view total corporate family exposure and concentration risk.

LEI Hierarchy: Map subsidiaries to ultimate parent

Exposure Limits: Aggregate across all entities in corporate tree

Credit Events: Propagate ratings changes up/down hierarchy

Regulatory Reporting

Entity Master data feeds directly into Form PF (SEC), AIFMD (EU), and MiFID II transaction reporting. Incomplete or inaccurate entity data can result in regulatory fines.

Transaction Lifecycle

The atomic events of the financial system. Moving from unstructured text to structured ISO 20022 data.

Transactions are the atomic events that change portfolio state. Every buy, sell, dividend, corporate action, or fee must be captured, classified, and processed through a standardized lifecycle. The industry is undergoing a historic migration from legacy messaging formats to ISO 20022—a shift that enables richer data capture and automated reconciliation.

ISO 20022 Standard

The industry is migrating from ISO 15022 (MT) to ISO 20022 (MX). This shift moves from unstructured text blocks to rich, structured XML/JSON data models capable of carrying "Ultimate Debtor", detailed remittance info, and regulatory identifiers.

Why It Matters

  • Straight-Through Processing (STP): Automated reconciliation without manual intervention
  • Regulatory Compliance: Embedded LEI, transaction IDs for MiFID II/EMIR
  • Cross-Border Payments: SWIFT gpi tracking, FX transparency
  • Corporate Actions: Structured election instructions (e.g., stock vs. cash dividend)
LegacyMT Format
ModernMX Format

Core Transaction Types

Trade ActivityBuy, Sell, Short, Cover
T+2 Settlement
Income ActivityDividends, Interest, Coupons
Cash Flow
Corporate ActionsSplits, Mergers, Spin-offs
Mandatory/Voluntary
Cash ActivityDeposits, Withdrawals, Fees
Non-Trade

Transaction State Machine

Every transaction progresses through a state machine from initiation to final settlement:

1
Pending

Order submitted, awaiting execution

2
Executed

Trade confirmed, price locked

3
Settled

Cash and securities exchanged (T+2)

4
Reconciled

Matched with custodian statement

Critical Transaction Attributes

Trade Date vs. Settlement Date

Trade Date (T) = when price is locked. Settlement Date (T+2 for equities) = when ownership transfers. Critical for accrual accounting.

Gross vs. Net Amount

Gross = principal. Net = principal ± fees ± accrued interest. Settlement systems use Net Amount.

Transaction ID (UTI)

Unique Transaction Identifier. Required for EMIR/Dodd-Frank reporting. Must be globally unique and persistent.

Counterparty

Who is on the other side? Broker, exchange, or internal transfer. Links to Entity Master for credit risk.

Tax Lot Accounting

The micro-structure of holdings. Managing distinct purchase events, cost basis, and selection methodologies.

A tax lot is a distinct purchase event with its own acquisition date, quantity, and cost basis. When you sell shares, the IRS requires you to specify which lots you're selling—a decision that can dramatically impact your tax liability. This is where lot selection methodologies become critical.

Modern portfolio management systems maintain lot-level granularity to enable tax-loss harvesting, wash sale detection, and optimal liquidation strategies. The difference between FIFO and HIFO can mean thousands of dollars in tax savings.

Lot Selection Methodologies

Impacts Realized Gain
MethodUtilityTax Impact
FIFO (First-In, First-Out)Simplicity; Regulatory DefaultOften highest tax in rising markets.
LIFO (Last-In, First-Out)Volatile MarketsMinimizes gains in rising markets.
HIFO (Highest-In, First-Out)Tax Loss HarvestingMaximizes realized losses.

Corporate Action Math: Tax-Free Spin-Off

When a company spins off a subsidiary (e.g., PayPal from eBay), the IRS requires you to allocate your original cost basis between the parent and the new entity based on their relative fair market values on the distribution date.

// Allocate basis based on Relative Fair Market Value (FMV)

1. Total_Value = Parent_FMV + Sub_FMV

2. Ratio_Parent = Parent_FMV / Total_Value

3. Ratio_Sub = Sub_FMV / Total_Value

New_Parent_Basis = Old_Basis × Ratio_Parent

New_Sub_Basis = Old_Basis × Ratio_Sub

Example: You bought 100 shares of Parent Co. for $10,000. After spin-off, Parent trades at $80 and Sub at $20. Total value = $10,000. Ratio_Parent = 80%, Ratio_Sub = 20%. New basis: Parent = $8,000, Sub = $2,000.

Wash Sale Detection

The Wash Sale Rule (IRS Section 1091) disallows a loss deduction if you repurchase a "substantially identical" security within 30 days before or after the sale. The disallowed loss is added to the cost basis of the new lot.

Detection Algorithm

FOR each sale with loss:

  Check purchases in [Sale_Date - 30, Sale_Date + 30]

  IF same CUSIP OR substantially identical:

    Disallow loss

    Add disallowed loss to new lot basis

    Extend holding period

Substantially Identical: Options on same stock, convertible bonds, ETFs tracking same index. Not substantially identical: Different companies in same sector, different share classes (e.g., GOOG vs. GOOGL).

Cost Basis

Original purchase price adjusted for fees, commissions, and corporate actions. Crucial distinction:

Covered SecuritiesBroker must report cost basis to IRS on Form 1099-B.
(Equities acquired post-2011, Mutual Funds post-2012, Options post-2014).
Non-Covered SecuritiesInvestor is responsible for tracking and reporting cost basis. Common for pre-2011 holdings or inherited securities.

Holding Period

Determines tax treatment: Short-term (≤1 year) taxed as ordinary income. Long-term (>1 year) receives preferential capital gains rates (0%, 15%, or 20%).

Short-Term

Up to 37% tax rate

Long-Term

0-20% tax rate

Adjusted Cost Basis

Original basis must be adjusted for:

  • Stock Splits: Divide basis by split ratio
  • Return of Capital: Reduce basis (not taxable until basis = $0)
  • Wash Sales: Add disallowed loss to new lot
  • Reinvested Dividends: Each reinvestment creates new lot

Position Management

The Three Books of Record. Separating views based on time and purpose.

In institutional finance, there is no single "position" number. Instead, systems maintain three parallel books, each serving a distinct purpose and operating on different time horizons. Understanding this separation is critical for reconciliation, performance measurement, and regulatory compliance.

The IBOR/ABOR/PBOR framework emerged from the need to balance real-time trading decisions (IBOR) with accounting accuracy (ABOR) and performance analytics (PBOR). Each book has different data sources, update frequencies, and stakeholders.

IBOR

Investment Book of Record

Focus: Trading / Decision Support

Time: Real-Time (T+0, intraday)

Data Source: Order Management System (OMS), Execution Management System (EMS)

Includes pending orders and intraday executions. Critical for preventing overdrafts and compliance with position limits. Used by portfolio managers and traders for real-time risk monitoring.

Example Use Case: Pre-trade compliance checks (e.g., "Will this order violate sector concentration limits?")

ABOR

Accounting Book of Record

Focus: Reporting / NAV Calculation

Time: T+1 / End-of-Day (Historical)

Data Source: Custodian statements, reconciled transactions

Reconciled with custodian. Used for official client statements, regulatory filings (Form ADV, 13F), and fund accounting (NAV calculation). Must match custodian records to the penny.

Example Use Case: Daily NAV calculation for mutual funds, client quarterly statements

PBOR

Performance Book of Record

Focus: Analytics / Attribution

Time: Time-Weighted Series (snapshots at cash flow events)

Data Source: ABOR + market prices + cash flow timestamps

Requires snapshots at every cash flow event to calculate TWRR accurately. Used for performance attribution, manager evaluation, and GIPS compliance. Separates manager skill from client timing.

Example Use Case: Quarterly performance attribution reports showing sector allocation vs. security selection effects

Why Three Books?

IBOR

Optimized for speed. May include unconfirmed trades. Traders need to see positions update instantly.

ABOR

Optimized for accuracy. Only settled, reconciled transactions. Accountants need to match custodian statements.

PBOR

Optimized for analytics. Requires historical snapshots. Performance analysts need time-series data.

Performance Measurement

Distinguishing between manager skill and investor timing.

Performance measurement is not simply "ending value minus beginning value." The challenge: external cash flows (deposits/withdrawals) distort returns. A portfolio that receives a large deposit right before a market crash will show poor returns—but that's not the manager's fault.

The solution: TWRR vs. MWRR. TWRR (Time-Weighted Return) isolates manager skill by neutralizing cash flow timing. MWRR (Money-Weighted Return) captures the investor's actual experience, including their timing decisions.

Return Methodologies

TWRR (Time-Weighted Return)

Evaluates Manager Skill. Eliminates the impact of external cash flows by breaking the period into sub-periods and geometrically linking returns.

TWRR = [(1 + R₁) × (1 + R₂) × ... × (1 + Rₙ)] - 1

Where Rᵢ = return for sub-period i (between cash flows)

Use Case: Manager evaluation, GIPS compliance, comparing to benchmarks

MWRR (Money-Weighted Return)

Evaluates Investor Experience. Equivalent to Internal Rate of Return (IRR). Weights return by capital invested—sensitive to timing of deposits/withdrawals.

0 = -PV + CF₁/(1+r) + CF₂/(1+r)² + ... + FV/(1+r)ⁿ

Solve for r (the MWRR)

Use Case: Client reporting, evaluating investor's actual dollar-weighted returns

Key Difference

If an investor adds capital right before a market crash, MWRR will be lower than TWRR (bad timing). If they add capital right before a rally, MWRR will be higher (good timing). TWRR is unaffected by these flows.

GIPS Standards & Cash Flows

The Global Investment Performance Standards (GIPS) mandate specific treatment of cash flows to ensure comparability across managers.

  • External Cash FlowCapital transferring in/out of the portfolio. Must be client-directed (not manager-initiated).
  • Large Cash Flow (>10% of portfolio value)Firm MUST revalue portfolio on the day of the flow to ensure TWRR accuracy. This creates a new sub-period.
  • Significant Cash Flow (>30% of portfolio value)Firm may temporarily remove portfolio from composite to prevent distortion of composite returns.

Modified Dietz Method

An approximation of TWRR that doesn't require daily valuations. Weights cash flows by the proportion of the period they were invested:

R = (EMV - BMV - CF) / (BMV + Σ(CFᵢ × Wᵢ))

Where Wᵢ = (Days remaining after CFᵢ) / (Total days in period)

Risk Management

Forecasting potential losses using IBOR positions and market data.

Risk management transforms IBOR positions (what you own) and Product Master attributes (how assets behave) into forward-looking loss forecasts. The goal: answer "What could go wrong?" before it does.

Modern risk systems combine statistical models (VaR), scenario analysis (stress testing), and liquidity classification to provide a multi-dimensional view of portfolio vulnerability.

VaR (Value at Risk)

What is the maximum loss over time horizon T at confidence level X?

Inputs: Current Positions + Historical Prices or Volatility Matrices

Parametric VaR: Assumes normal distribution. Fast but fails in tail events.

Historical VaR: Uses actual historical returns. No distribution assumption.

Monte Carlo VaR: Simulates thousands of scenarios. Captures non-linear risks (options).

95% VaR = $1M means: "95% confident we won't lose more than $1M tomorrow"

Stress Testing

Subjecting portfolio to hypothetical extremes (e.g., '2008 Crisis', 'Fed +200bps').

Req: Full Revaluation using Product Master attributes (Duration, Convexity, Beta)

Historical Scenarios: Replay 2008, COVID-19, 1987 crash

Hypothetical Scenarios: "What if rates rise 3%?" "What if VIX spikes to 80?"

Reverse Stress: "What scenario would cause a 20% loss?"

Regulatory: CCAR (Fed), EBA (EU) mandate annual stress tests for banks

Liquidity Risk (LCR)

Classifying assets by ability to convert to cash without material price impact.

Metrics: Bid-Ask Spread, Average Daily Volume (ADV), Depth of Book

Tier 1 (High Liquidity): Large-cap equities, US Treasuries. Can liquidate in hours.

Tier 2 (Medium): Small-cap stocks, IG corporate bonds. Days to liquidate.

Tier 3 (Low): Private equity, real estate, distressed debt. Months to liquidate.

Liquidity Coverage Ratio (LCR): Basel III requires banks to hold enough liquid assets to survive 30-day stress

Integrated Risk Dashboard

Modern risk systems aggregate these metrics into a unified dashboard that updates in real-time as positions change:

Market Risk

VaR, Greeks (Delta, Gamma, Vega), Duration, Convexity

Credit Risk

Counterparty exposure, credit ratings, CDS spreads

Liquidity Risk

Days-to-liquidate, bid-ask costs, market depth

Concentration Risk

Single-name limits, sector exposure, geographic concentration

Data Architecture

System design principles: Master Data Management and Bitemporality.

All the concepts above—Product Master, Entity Master, Transactions, Tax Lots—must be engineered into a coherent data architecture. Two design patterns are critical: Master Data Management (MDM) for resolving conflicting data sources, and Bitemporal Design for preserving audit trails.

The Golden Copy (MDM)

Solves the "dueling data" problem. A centralized system ingests data from Bloomberg, Reuters, FactSet, etc., and applies Survivorship Rules to determine the single source of truth.

IF Asset_Class == 'Fixed Income'
  THEN Source Rating FROM S&P
ELSE IF Asset_Class == 'Equity'
  THEN Source Sector FROM MSCI
ELSE IF Asset_Class == 'Derivative'
  THEN Source Greeks FROM Bloomberg

Why MDM Matters

  • Data Conflicts: Bloomberg says Apple's sector is "Technology", MSCI says "Communication Services"
  • Vendor Outages: If primary source fails, automatically failover to secondary
  • Audit Trail: Track which vendor provided each field and when
  • Data Quality: Centralized validation rules (e.g., "Price cannot be negative")

Bitemporal Design

Tracking two timelines for every record to preserve complete audit trails and enable "as-of" queries.

Valid Time (Business Time)The real-world date the data applies to. Example: "Apple's price on Jan 2, 2025 was $180"
Transaction Time (System Time)The date the data was entered into the system. Example: "We discovered the error and corrected it on Jan 10, 2025"

Critical Use Cases

  • Regulatory Audits: "Show me what you knew on Dec 31, 2024" (query by Transaction Time)
  • Performance Recalculation: "Recalc returns using corrected prices" (query by Valid Time)
  • Error Correction: Fix historical data without losing record of the mistake
  • Compliance: Prove you acted on information available at the time

Data Lake vs. Data Warehouse

Data Lake (Raw Storage)

Store everything in native format (JSON, CSV, Parquet). Schema-on-read. Used for ML training, exploratory analysis.

Tech: S3, Azure Data Lake, Snowflake

Use: Historical tick data, unstructured research reports

Data Warehouse (Curated)

Store cleaned, transformed data in relational schema. Schema-on-write. Optimized for BI queries and reporting.

Tech: Redshift, BigQuery, Databricks

Use: Daily NAV, client statements, regulatory filings

Continue Learning