Folio MMXXVI
Pratham N. Shah  ·  Penn State University
PNS Research
Research · Data Science
Authored  ·  Three Active Projects
Penn State University
FOLIO · 00 — INTRODUCTION EST. MMXXIV
PNS · 00 Research Folio  ·  Penn State University

Structuring
what the models
leave out.[0]

Three research initiatives targeting the qualitative friction channels — geopolitical escalation, SEC footnote hedging schedules, and human capital workforce diligence. By mapping messy, unstructured records to bounded risk indices, these frameworks seek to build the numbers before the narrative.

[0] The number is the deliverable.
Everything else is context.
Research Portfolio  ·  Three Active Projects
PLATE · I
VALIDATED
Project Goldstein
Geopolitical Volatility Signal
12 Regions  ·  2022–2026

A scoring engine that takes real-time conflict data from GDELT and ACLED and turns it into a daily variance score for each region. The core bet: geopolitical instability predicts how much a linked ETF moves, not which way.

64.4%
Hit rate
0.23
Spearman IC
323
Events
PLATE · II
BACKTESTED
Project Caligula
SEC Footnote Ingestion
8-Pillar Scoring  ·  2014–2026

A research engine that parses unstructured SEC footnotes with Gemini AI to score proved reserves, hedging contract floors, and unit margins. The core bet: micro-fundamental quality signals predict stock outperformance.

8
Pillars
31
Tickers
12y
Backtested
PLATE · III
IN DEVELOPMENT
CHC Platform
Private Equity · Human Capital
Mid-Market Diligence

A diagnostic framework that scores workforce health for private equity targets before close. Five components, anchored to public records, normalised by sector. The output is a single composite score that goes into the data room alongside the financials.

5
Components
4
Risk tiers
72h
Turnaround
Methodology  ·  What our projects share

i. DisciplineBuild the number
before the narrative.

Most risk conversations in finance end with a qualitative opinion: a country rating, a management bio, an analyst note. Those outputs cannot be backtested, compared across deals, or revisited after the fact.

Our projects start from the same constraint: take messy, partial inputs and produce one bounded, sector-normalised score. The score is the deliverable. Everything around it is context.

Bounded score · Sector-normalised · Audit trail preserved

ii. HonestyPublish the
false positives.

Goldstein publishes a 29.2% false positive rate and a 252-day warm-up exclusion per region. CHC explicitly disclaims predictive ability and frames its output as a descriptive diagnostic.

None of our projects claim more than the data actually supports. A coefficient needs to clear statistical significance. A score needs a documented methodology and a pipeline someone else could reproduce.

Out-of-sample validation · Documented limitations · Reproducible
The score is the deliverable.
Everything around it is context.
— Method note · § 02
FOLIO · I — GOLDSTEIN GEOPOLITICAL VARIANCE SIGNAL
PNS · I Project Goldstein  ·  Variance, not direction

Geopolitical risk
is a variance input.
Model it[1] as one.

Conflict data moves volatility, not price. This project tests that idea across 12 global chokepoints — does real-time instability from GDELT and ACLED predict how much a linked sector ETF moves, rather than which direction?

View on GitHub
Research Approach

i. FrameA question worth
testing.

Most geopolitical risk tools end in a qualitative output: a country rating, an analyst note, a headline index. None of them produce a number you can backtest.

This project asks something narrower: does event-level conflict data predict variance increases in linked financial instruments? Not returns, not direction. Variance. That distinction determines whether the signal is practically useful or just intellectually interesting.

Across 12 chokepoints and four years of data, the answer is yes, with statistically significant coefficients and a hit rate that survives out-of-sample testing.

2022–2026 · Penn State University

ii. ConstructSignal construction
and validation.

The Geopolitical Risk Premium Score (GRPS) has three components: event-based instability from GDELT and ACLED, sector volatility premium versus rolling benchmarks, and VIX z-score conditioning to account for broader market fear.

Each region goes through a 252-day warm-up before any score gets issued. No regime label is published until that window closes. The false positive rate, 29.2%, is included in every summary; it is not buried.

The scoring engine itself is private. The data pipeline, fetchers, quality checks, and backtest framework are all open-source and documented.

github.com/prathamislit/Project-Goldstein · GDELT BigQuery · ACLED API
I. Variance, not returns
Geopolitical events
move volatility, not returns.

Across all 12 regions, γ coefficients for geo-to-variance range from 0.784 to 0.934. No significant return-prediction relationship showed up. The variance channel is the one that works.

γ ∈ [0.78, 0.93]
II. Out-of-sample
The signal holds
out of sample.

323 threshold-crossing events validated across a 21-day forward window. Average hit rate for vol exceeding the 75th percentile post-crossing: 64.4%. The signal measures a structural regime, not an inefficiency, so it does not decay on publication.

21-Day Forward Window
III. Proxy isolation
Chokepoint selection
determines proxy.

The ETF proxy has to isolate region-specific exposure. Early iterations used XLE for multiple regions and got mathematically identical signals. Each of the 12 final proxies was chosen to maximise independence of the geopolitical channel.

12 Independent Proxies
IV. Regime labels
Regime classification
is sufficient.

The continuous 0–100 score adds less information than the three-regime label (STABLE, ELEVATED, CRITICAL) for practical risk management. The regime label is the summary output; the score is the audit trail.

STABLE · ELEVATED · CRITICAL
V. VIX conditioning
VIX conditioning
matters.

Adding VIX z-score as a conditioning variable cut false positives by roughly 8 percentage points versus the unconditioned model. Broader market fear modulates how much local instability actually translates into sector variance.

−8pp False Positive
VI. Warm-up exclusion
Warm-up exclusion
is non-negotiable.

The first 252 trading days per region are excluded from all validation, no exceptions. Evaluating before enough data accumulates produces inflated coefficients. Published results only reflect the post-warmup window.

252-Day Hard Cut
Validated Results  ·  2022–2026
RegionETF Proxyγ (geo → variance)p-valueRationale
Middle EastXLE0.934< 0.001Energy sector — direct oil supply exposure
Eastern EuropeXME0.918< 0.001Metals & mining — commodity shock channel
Taiwan StraitSOXX0.897< 0.001Semiconductors — TSMC supply chain risk
Remaining 9 regions90-day post-warm-up validation window in progress
64.4%
Avg hit rate · vol > 75th pct
29.2%
Avg false positive rate
0.23
Avg Spearman IC
323
Validated crossing events
FOLIO · II — CALIGULA SEC FOOTNOTE QUALITY ENGINE
PNS · II Project Caligula  ·  Footnote ingestion

Structuring what
the models
leave out.[2]

Public financial screens skip what matters most — hedging cushions, proved reserve replacement lifespans, well-unit margins. By extracting SEC footnotes via Gemini AI, Caligula constructs a sector-normalised corporate quality index. Query a covered ticker from the 31-name study universe to build a footnote-driven deep dive.

Ingesting SEC Filings & Computing 8 Pillars…
E&P Research Study Universe  ·  Latest Rankings Ledger
Rank Ticker Tier Composite Unit Econ. Cap. Disc. Bal. Sheet Hedge Book Reserves Operational Sentiment Macro
Loading study rankings…

Universe normalised over weighted E&P sub-scores. Point-in-time dates enforced via EDGAR cutoff indexes. Click any row for the single-name deep dive.

Backtest Simulation Engine  ·  2014 Q1 → 2025 Q4

i. PremiseHistorical variance
and returns.

A 12-year quarterly point-in-time simulation, rebalancing every quarter. We buy the top quartile of highest-ranked corporate names (Long basket) and short the bottom quartile (Short basket) to verify the predictive risk-adjusted edge of the 8-pillar scoring engine.

ii. UniversePortfolio selection.

The simulation runs across two studies — the focused Permian Basin E&P Study and a diversified General Corporate Equities index. The Sharpe improvement is the experiment: does pulling qualitative detail out of footnotes actually move the risk-adjusted return?

Cumulative wealth path (1.0 = par)

Quarterly return breakdown

Quarterly Detail Ledger
Quarter Long Return Short Return L/S Net N (Long) N (Short)
Single-Name Diligence Ledger

Operator EOG Analysis

Energy  ·  Oil & Gas Exploration & Production  ·  Trailing LTM

Pillar coverage breakdown

Metrics ledger

Metric Value Score
Historical Quality Path

Diligence score evolution

FOLIO · III — CHC PLATFORM PRIVATE EQUITY · HUMAN CAPITAL DILIGENCE
PNS · III CHC Platform  ·  Private equity diligence

Finding the substance
the spreadsheet
leaves out.[3]

Human capital is usually the biggest driver of value in a buyout, and the least quantified input in the deal memo. The CHC Platform turns workforce risk signals into a composite score that goes into the data room alongside the financials, before close.

View on GitHub
The Problem  ·  Why Human Capital Fails Due Diligence

i. The GapWhere diligence
runs out of numbers.

PE due diligence is rigorous where data is structured: financials, legal, tax, customer concentration. But the moment you get to the people actually running the business, it turns qualitative and essentially unverifiable.

Consulting firms have written about this gap for decades. Two-thirds of post-merger integration failures trace back to culture and workforce issues, yet most deal memos give human capital fewer than two pages, usually just a management bio.

The problem is not that buyers don’t care about workforce health. It is that no structured, scoreable instrument exists to assess it. This framework is an attempt to build one.

McKinsey · Korn Ferry · Deloitte Insights · Harvard Business Review

ii. The TargetWhy private
mid-market?

Large-cap public targets have Glassdoor data, 10-K disclosures, analyst coverage, LinkedIn headcount signals. They are still hard to assess, but the signals at least exist.

Private companies with $50M–$500M EBITDA, the core of the middle market, have almost none of this. No EDGAR filings, no Glassdoor depth, no public attrition data. PE sponsors acquiring these companies are pricing workforce risk with essentially no structured input.

That is the segment this platform focuses on. The data architecture is built for the constraints of private, mid-market targets: it anchors on public-record signals and primary survey data collected during diligence, instead of relying on historical disclosures the target never made.

Mid-market: $50M–$500M EBITDA · ~$400–600B annual US PE volume (Pitchbook 2022–2024)
Framework  ·  The Five-Component HCV Index

The HCV Index is the quantitative core of the platform. It produces a single composite score, bounded 0 to 1, from five independent sub-components. Each component draws from a different data layer. Sector normalisation against published industry benchmarks is in progress; until it lands, scores are most meaningful within a single sector rather than across deals.

i. TDS
TDS
Tenure & Departure
Signal
Workforce seniority distribution and departure velocity from primary survey responses, weighted by role tier and functional criticality.
Dynamic Weight
ii. BDR
BDR
Business Disruption
Risk
Pulls in passive external signals (job posting velocity, WARN Act notices, OSHA citations) to build a market-observable disruption indicator. A WARN Act notice triggers a hard score floor.
Dynamic Weight
iii. SDS
SDS
Survey-Derived
Signal
Anonymised primary survey from a stratified sample of target employees during diligence. Twelve validated questions covering leadership trust, psychological safety, and intent to stay.
Dynamic Weight
iv. VGS
VGS
Velocity & Growth
Signal
Structural growth indicators: headcount trajectory, role expansion ratios, hiring pattern analysis, normalised by sector baseline and company stage.
Dynamic Weight
v. MCS
MCS
Market Comparator
Signal
Sector peer benchmarking through employment litigation records (PACER), executive departure events, and public peer-group data to put absolute scores in context.
Dynamic Weight

iii. OutputThe CHC Report

The platform produces a Cultural Health Certificate, a PDF that can sit in an M&A data room next to the financial and legal due diligence reports.

The report includes the composite HCV score, each sub-component with its contributing signals, a four-tier risk classification (GREEN / YELLOW / ORANGE / RED), sector-relative percentile, and a narrative flag section summarising material risks for the deal team.

The sell-side deployment model means the target company commissions the CHC during deal prep, typically 60-90 days before marketing, and includes it in the Virtual Data Room. Investment banks advising the process become the natural distribution channel.

Report: WeasyPrint / Jinja2 · Delivery: authenticated link · Turnaround: 72h from survey close

iv. LimitsWhat the score
does not do.

The HCV Index is a descriptive diagnostic, not a predictive model. It does not claim to forecast post-close turnover or EBITDA impact with precision.

The claim is narrower and more defensible: a structured, multi-signal composite score gives better diligence coverage than unstructured management interviews alone, and it produces a comparable, archivable record of the assessment.

Component weights draw on published practitioner literature (Korn Ferry, Deloitte Human Capital Trends, Mercer workforce risk indices) and are designed to be recalibrated as engagement data accumulates.

Not a financial instrument · Not investment advice · Score is informational input
Data Architecture  ·  Signal Sources
Primary · SurveySRC · 01
Employee
Survey
12-question survey deployed to a stratified employee sample via single-use UUID tokens. No PII stored. AES-256-GCM encrypted at rest.
Public Record · FederalSRC · 02
WARN
Act Notices
WARN Act filings: publicly reported mass layoff and plant closing notices. Triggers a hard BDR score floor when present.
Public Record · FederalSRC · 03
OSHA &
PACER Records
OSHA enforcement records and PACER federal court employment litigation dockets. Free public data with structured query access, no proprietary sources needed.
Passive · MarketSRC · 04
Job Posting
History
Historical job posting velocity and pattern analysis via Apify. Available for any company that has ever posted publicly; the main passive signal for private mid-market targets lacking Glassdoor depth.
Industry Impact  ·  The Size of the Gap
73%
M&A Deals Fail to Achieve Synergies
The fraction of M&A deals that fail to deliver projected value. Human capital and cultural misalignment are consistently cited as leading explanatory variables.
McKinsey & Company · post-merger integration meta-analysis
67%
Failures Attributed to People & Culture
Of deals that miss their value creation thesis, the majority cite human capital factors: management departure, cultural resistance, talent attrition.
Deloitte Human Capital M&A Survey · Korn Ferry M&A Research
<5%
Deal Memos with Structured HC Assessment
The estimated share of PE deal memos that include a structured, scoreable human capital assessment rather than a qualitative management bio. The instrument barely exists at scale.
HBR · Institutional Investor practitioner surveys
Framework Validation  ·  Design Properties
PropertyDesign TargetImplementationStatus
HCV Score Range0.0 – 1.0 boundedProprietary normalisation over weighted sub-score · Sector-adjusted✓ Bounded
Risk Classification4-tier: GREEN / YELLOW / ORANGE / REDProprietary monotone thresholds✓ Implemented
WARN Act Hard FloorBDR floor triggerProgrammatic override in BDR computation · Cannot be suppressed✓ Enforced
Survey AnonymisationNo PII stored at any pointUUID4 tokens · No name/email linkage · AES-256-GCM encrypted✓ By design
Single-use Survey TokensEach respondent token: one submissionToken invalidated on first use · Duplicate submission returns 400✓ Enforced
Authenticated EncryptionTamper detection on all stored dataAES-256-GCM · InvalidTag exception on any bit-flip · No silent corruption✓ GCM Mode
Sector NormalisationCross-sector comparabilityPublished benchmark means/stdevs (Manufacturing, Healthcare, SaaS/Tech, default)In Progress
Component Weight SourceGrounded in literatureKorn Ferry, Deloitte, Mercer practitioner research · Recalibration on engagement dataProposed
Private Company Data CoverageWorks without Glassdoor / 10-KJob posting history (Apify) · WARN Act · OSHA · PACER · Primary survey✓ Public-record anchored
Report DeliveryData-room ready PDF, 72-hr turnaroundWeasyPrint + Jinja2 pipeline · JWT-authenticated delivery linkIn Development

§ FinalResearch
correspondence.

Methodology questions, academic discussion, collaboration proposals, or requests for the full documentation and validation walkthrough. This is a research portfolio, and correspondence is welcome from anyone engaging seriously with the work — whether in academia, private equity, banking, or human capital consulting.

Direct correspondence
 
Subject lines: Project Goldstein for variance signal · Project Caligula for footnote quality engine · CHC Platform for HCV framework.
Author
Pratham N. Shah
Institution
Penn State University
Major
Data Science
Response
~48h, weekdays