Selected Work · 2026

Eleven projects.
Five deployed, two killed on their own bars, two papers,
one engine underneath.

I design AI systems against the friction points of real operations, informed by a career across sales, technical sales, and field application engineering in solar and energy storage, from C-suite to field installer, five-figure to seven-figure deals. I'm not a trained engineer or programmer; that turned out to be the angle that produced the work below. The portfolio includes a governed support-knowledge console deployed in daily use inside a multinational manufacturer's support organization, a complete beta-program architecture for a new residential product family, a document-intelligence engine in production with paying clients, a 74-module measurement engine with 43 USPTO provisional filings, two working research papers (IBTR / TRIL and DALD), a live consumer reference app, a cross-domain falsification that closed on null, an inverted-architecture solar vertical that reached the same data-access ceiling that limits the broader engine, an EU AI Act reference architecture killed within a day of a self-commissioned competitive review, and an operating California advisory practice where AI is auditable production infrastructure under a published Terms of Service and attorney-reviewed engagement agreement.

USPTO Provisionals Filed

1.0 / 0.0

Retrieval Hit / False-Confidence Rate, Deployed Support Console

3/3

Projects Closed on Their Own Bar or Structural Ceiling

Legal-Doc Drift Catches in Audit

§ 01 · Thesis

I build operational tools and the primitives underneath them.
And I instrument both so I can kill what fails.

The work below runs in two layers. The first is deployed operational tooling: a governed support-knowledge console in daily use inside a support organization, a beta-program architecture running a live hardware program, and a document-intelligence engine with paying clients. The second is the measurement substrate underneath: primitives, attestation interfaces, and open standards, tested for whether they transfer across domains. Each layer feeds the other; the primitives earned their claims by shipping against them.

The portfolio is structured around falsifiability. Each entry declares a bar. Several clear it; two close cleanly on failing to clear it; one closes against a structural ceiling that limits the broader engine itself. A commercially credible design can still fail as a business, a sound architecture can still lose to a free incumbent, and clearing synthetic-substrate tests says nothing about real-data validation. The closures are featured rather than buried, because a portfolio that only shows wins demonstrates selection, and I want it to demonstrate judgment.

I'm not a trained engineer or a trained programmer. The discipline of AI architecture is roughly twelve months old, and nobody has been trained for it yet. Autodidacts working from operational reality hold a window of advantage, and the window is open now.

A career across sales, technical sales, and field application engineering, C-suite to field installer, five-figure to seven-figure deals. That built the cognitive substrate the AI work draws from: pattern recognition across modalities, liability instinct, cross-system reasoning, and the habit of operating in the gap between what a customer says and what's actually wrong. Operating exposure inside five multinationals with markedly different norms around risk, hierarchy, and contract precision. Tigo Energy through IPO preparation, present in executive decision-making during the run-up, watching the legal, compliance, and liability apparatus get built around an operating business in real time.

Broad architecture funds vertical execution. PIE is sensor-agnostic, domain-agnostic, worldwide-applicable, and stands as a substrate. Every commercial extraction came from narrowing: FretMind, WindPIE, SolPIE, Cardinal, The Installer's View, The Installer's Lens. Narrow versions moved faster and reached defensible architectural conclusions the broad version never could. Both layers are necessary, because broad investment generates the substrate that vertical execution consumes. All six extractions inherit the same structural ceiling: synthetic substrates and public data can take work to defensible architectural conclusions, but real predictive validation requires institutional data access that solo investigation cannot obtain. Applied to AI: frontier capability is broad; products that survive pick a vertical and execute hard while capability stays general.

AI was used throughout as a structured tutor and architect rather than a code generator. Frontier LLMs walked me through unfamiliar territory step by step while I executed the actual work, with active pushback when something didn't fit. The work shipped at the same speed it would have anyway, and the skill compounded instead of being outsourced.

I trust my intuition because it has been trained against consequences in the field; whether the work below earns that trust is the reader's call.

A note on the meta-work. Several entries below include a section titled "what I did not do." Those are real disclosures. They identify pressure tests a more rigorous pass would have included but didn't. If a hiring manager wants to know how I assess my own work, that section is the answer.

§ 02 · Featured Work

Eleven projects, deployed work first.

Deployed & Operational

D-01 · Deployed Support Infrastructure Deployed · In Daily Operational Use

P-10 · Governed Support-Knowledge Console

A single-file offline support console in daily use by a multinational manufacturer's customer support and engineering team, covering a residential solar and storage product line. The interface is a compiled artifact; the asset underneath is a governed, citation-complete, machine-readable knowledge corpus.

Architecture

All content lives as structured JSON: components, fault codes, procedures, compliance facts, configurations, Q&A entries, cross-links, and search aliases. A deterministic Python assembler compiles that data into a dependency-free HTML file that runs from a file share with no network calls. The same input always produces the same md5, and every build passes a battery of checks before it ships. An authored concept and intent router sits in front of a scored literal matcher with alias mapping and typo tolerance; the build-side audit tooling and the shipped JavaScript are parity-tested against each other, so the test harness and the product provably rank results the same way.

Governance

344 per-fact citation bindings, each resolving to a page image of its source document; builds fail hard on any stale or unknown reference. Attribution carried as data: every fact is tagged document-sourced, verified field knowledge, field knowledge supplementing a document, or industry background, with a precedence model governing conflicts (field experience leads on troubleshooting procedure; documents are absolute on specs, limits, and warranty conditions). Gated ingestion with regression baselines covering search relevance, link integrity, reachability, terminology, and an enforced contact whitelist. Coverage gaps are declared instead of hidden.

The AI variant, measured

The deterministic search router doubles as the retriever, and the model synthesizes answers under a system prompt built from the tool's own trust rules: answer only from the provided material, never affirm or deny warranty coverage, escalate when the material is thin. In its first measured evaluation it scored a 1.0 retrieval hit rate and a 0.0 false-confidence rate on the curated test battery; every out-of-scope and wrong-product probe was declined instead of answered. The corpus ships with its own evaluation infrastructure, a generated bank of 1,466 field-realistic test questions tagged with expected answers.

What I did not do

The AI variant is measured against the curated battery, and has not yet been deployed to end users. Usage telemetry and a maintainer correction loop belong to the hosted phase of the roadmap, along with retrieval-augmented answering in production.

344 citation bindings Deterministic compiled builds 1,466-question eval bank 1.0 / 0.0 measured

The Hook

The pipeline is product-agnostic by construction, and product identity flows entirely from data: a second product line is onboarding from the same engine with zero assembler changes. Most retrieval projects inherit duplication, stale revisions, and unsourced claims from chunked PDFs; a retrieval layer built on this corpus inherits grounding, citations, and confidence semantics from day one.

P-08 · Analytical Platform · In Production In Production · Paying Clients

The Installer's Lens · Solar Advisory Analytics Platform

The analytical engine underneath The Installer's View, in production and powering paid homeowner advisory deliverables: a Python-based multi-source verification platform that automates the labor-intensive solar-proposal review workflow.

The problem

California homeowners reviewing rooftop solar proposals receive economic projections that frequently assume legacy NEM 2.0 economics, despite the state having transitioned to NEM 3.0 / NBT in April 2023, under which export credits fell roughly 75% on average. Independent verification has historically required either paying a competing installer for a counter-bid (same incentive bias) or hiring an engineer ($1,500+, weeks of turnaround). The Installer's Lens automates the verification workflow at a fraction of the cost and turnaround.

Architecture

Single-user Streamlit UI wrapping a Python backend that parses unstructured intake (utility bills via vision-model OCR, Green Button XML, panel photos, installer proposal PDFs), orchestrates parallelized API queries across 16 authoritative public data sources, runs a 10-lens analytical framework, and generates client deliverables via template-driven PDF rendering with full audit trail. Sources span solar resource modeling (NREL NSRDB, PVWatts, PySAM), roof geometry (Google Solar API), equipment validation (CEC), installer verification (CSLB, CFPB, CourtListener, SEC EDGAR), environmental context (EPA AirNow, CAL FIRE, CPUC PSPS), and utility-specific NBT rate schedules.

What I specifically architected

Sole architect and product owner, directing full implementation through an LLM coding agent under written standing instructions, with human approval gating each merge; every architectural decision, data model, and release passed a structured multi-role design review before build. Safety and integrity are structural: fail-closed vision-based PII redaction on every document format before any data leaves the machine, a six-gate intake validation system, copy-fidelity canaries that fail the build if rendered prose drifts from approved text, and a principal-verification seam that makes it mechanically impossible for machine-drafted judgments to reach a client without human approval. Verified by a 1,200+ test suite with typed contracts and in-code API budget guards. Local-first storage (SQLite + per-engagement file structure) for audit defensibility. Fifth extraction of the PIE primitives: baseline-measurement intelligence applied to system-level economic verification.

What I have not yet done

Third-party end-to-end accuracy review of client deliverables has not been commissioned; verification to date is internal (the 1,200+ test suite, typed contracts, and principal review of every delivered analysis). Load behavior under high engagement volume and adversarial prompt-injection resistance on intake parsing remain untested at scale.

Python · Streamlit · SQLite 16 public data sources Vision-model OCR + parsing 10-lens analytical framework Fifth extraction of PIE primitives

The Hook

The working example of AI-augmented analytical labor at the cost structure that makes independent solar advisory economically viable, with real paying clients. The previous alternative was a $1,500+ engineering report on a $25,000 system decision: rare and slow. What ships now is AI-orchestrated multi-source verification with a full audit trail and principal verification on every analytical judgment, delivered by one domain expert treating AI as a governed engineering team.

P-07 · Active Business · Parallel Practice Live · Operating Practice

The Installer's View · theinstallersview.com

An independent California solar advisory practice for residential homeowners, live at theinstallersview.com: a cookieless, zero-tracking publication with 12 long-form consumer-protection articles at launch and a weekly cadence, operated solo on top of a full-time engineering career.

Architecture

The site launched on a conventional small-business stack (managed hosting, a commercial theme builder, the usual retinue of plugins) and was then fully replatformed: the live site today is hand-coded on the Astro static site generator with a custom CSS design system, no theme, no page builder, and no plugins, version-controlled on GitHub and continuously deployed to Cloudflare Pages, with the DNS migration executed live on an operating domain without dropping business email. The rebuild took the stack from a dynamic PHP application with a database and login surface to pre-rendered static files behind a global CDN: sub-second page loads, green Core Web Vitals, zero visitor cookies (no consent banner required because nothing is tracked), and minimal attack surface. Frontier LLMs handle content synthesis, document review, and operational reasoning against vendor criteria enforced contractually: no training on client data, encryption in transit and at rest, time-bounded retention. Three specialized AI workstreams (site and editorial, engine, social) coordinate through one canonical operating record: an append-only, source-tagged decision log past 1,150 entries with read-before-write conflict protocol. Every article passes a documented production discipline: a per-article research pass anchored to primary sources, a factual-claims audit log with per-claim re-verification schedules, and a locked voice specification with mechanical screening for AI-generated prose patterns; a site-wide register audit eliminated 284 punctuation and phrasing tells in a single reviewed sweep. Professional liability and cyber coverage carried; the practice is structured so that AI-produced work product is defensible to clients, regulators, and counsel. Payments, banking, insurance, and trademark all stood up solo.

What I specifically architected

Position B (AI in standard production, principal verifies all judgments) encoded consistently across Privacy Policy, Terms of Service, Engagement Agreement, and service-page FAQ. No-named-AI-tools rule in public materials: vendor selection criteria are durable; vendor names will churn. Nine-dimension content audit run against every public-facing draft. Editorial firewall against reviewing proposals from any installer TIV has consulted to within a defined recent window. Four-pillar service architecture with explicit "what TIV cannot do" scope.

Pressure-testing

Trademark / prior-art vetting pre-commit. Formal legal-docs drift audit against v1 Privacy Policy and Terms identified eight material drifts (sole proprietorship → LLC, Position A → Position B, a payment-processor change, voice register, AI use disclosure missing) and v2 drafts produced with changelog. Engagement Agreement routed for attorney review with directed-attention notes to Limitation of Liability (CA Civil Code §1668), Indemnification reasonableness, and Governing Law. Affiliate-bias falsifiability test caught and rewrote my own original About-page draft as false once affiliate revenue was on the roadmap.

The build method

I direct AI systems the way a general contractor directs trades: specify in plain language, let the model draft the artifact (code, copy, contract, design), review against domain knowledge, and approve or redline before anything ships. Site code is written by AI agents working inside the repository under a written standing-instructions file that encodes the brand system and the hard rules (no deploy without human approval, no invented copy, no tracking, legal pages locked), so each session starts already knowing the house rules. I review diffs, run the build, and push from a Linux terminal; a git push is the entire deployment pipeline. The same pattern governed the replatform itself: the migration plan, the DNS cutover sequence, the new codebase, and the post-launch audits were all AI-drafted, human-verified, and shipped in reviewable increments.

What I did not do

Did not run formal cross-LLM arbitration on the legal or content artifacts; a single frontier model was the primary synthesis LLM throughout. Running the Privacy Policy, Terms, and Engagement Agreement through a second model with a structured critique prompt, independent of attorney review, remains a meaningful addition.

Astro + Cloudflare Pages · zero cookies Position B AI integration Attorney-reviewed Master Engagement 1,150+ entry decision log

The Hook

A regulated professional advisory practice where AI is production infrastructure and the architecture is constrained by professional liability insurance, unauthorized-practice risk, and CCPA disclosure law. The governing question throughout: what architectural guardrails are required so AI-produced work product can be defended to clients, lawyers, and regulators?

D-02 · Deployed Program Architecture Deployed · Live Hardware Program

Beta Program Architecture · Residential Product Family NPI

The complete documentation and process architecture for beta testing of a new residential product family (AC microinverter, AC combiner, and energy storage), covering site qualification through closeout, built inside a multinational manufacturer's field applications organization.

Deliverables

An FAE-scoped SOP with on-site capture requirements, a pass/fail site qualification system, a program workbook serving as system of record (site tracker, live hardware roll-up, structured form build specifications), Voice of Customer instruments for installers and homeowners, a per-site process record, and an illustrated field photo guide. Legacy tracking modernized from an 80-column tracker to 36 purposeful fields, with static pivot reporting replaced by live formulas.

Method

All documents produced through AI-assisted programmatic generation (Node.js docx, Python openpyxl, vector illustration pipelines), with iterative direction, domain review, and final technical judgment retained by the principal. Weeks of drafting compressed into days, with cross-document alignment enforced by the pipeline itself, including identical checklist content between the SOP appendix and the live workbook tool.

Site qualification → closeout VoC instruments, installer + homeowner 80 → 36 field tracker modernization Programmatic doc generation

The Hook

Program architecture treated as a compiled product: consistent styling and cross-document alignment enforced by the generation pipeline rather than by proofreading, on a live hardware program with real installers and homeowners in the loop.

P-03 · Live Reference Implementation Live · Personal Practice Tool

FretMind · The Engine in Production

The consumer reference app that proved PIE works in a live capture loop. The meta-artifact on which AI collaboration patterns produced novel work versus runaway scope.

Architecture

Browser-based: Web Audio API for real-time pitch and rhythm, MediaPipe Hands for body mechanics, Basic Pitch (Spotify) for audio-to-MIDI. Five stateful coaching personas route to different LLM prompting and feedback styles. Welford-based individual baselines with z-score classification against the player's own history. DALD monitors divergence between AI-claimed session quality and the player's independently-measured improvement trajectory. VTACA detects breath-holding patterns during cognitive load. PBITE gates interventions on practice-quality signal.

What I specifically architected

Persona system as a structural product decision: each persona routes to different prompting and recommendation libraries. April 2026 triage: 22 modules KEEP for consumer use, 10 SIMPLIFY, 30 REMOVE (retained in reference repo), 6 REFERENCE-ONLY. Shipped the hand-written browser port first rather than waiting for full-engine integration, prioritizing real sessions over architectural purity.

Pressure-testing

Two live practice sessions in April 2026 with real audio capture, verifying VTACA breath-hold detection and PBITE intervention gating worked in production rather than in simulation. Caught mic-clipping (sessions 1–6 corrupted due to laptop mic proximity), traced to input sensitivity rather than engine failure. Forced the distinction between "commercially credible" and "commercially viable" and admitted FretMind achieved the first but not the second. None of PIE's core functional claims (quality score validity, PBITE gating, DALD, BAIV, CAAD, AICV) have been empirically validated on real deployment data.

Self-disclosed limitation

The currently-live demo is a hand-written browser port of three modules (Baseline, FlowDetector, PQTracker), not the full 74-module engine. PIE Clip and PIE Band hardware prototypes are designed and parts purchased; build is sequenced behind TIV launch.

Web Audio API · MediaPipe Basic Pitch (Spotify OSS) 5 coaching personas 2 live sessions logged

The Hook

FretMind is the worked example that documents which prompt patterns produced novel work versus runaway scope expansion. The technical work is real. The meta-work is what makes FretMind useful: recognizing how the AI conversation itself shaped the architecture, often unhelpfully, and naming the conversational dynamics that caused it.

Research & Primitives

P-01 · Engine + Open Standard Actively Developed

PIE · Predictive Individual Engine

A sensor-agnostic measurement layer that scores any system's real-time state against its own historical baseline. Humans, hardware, AI systems, field-deployed sensors. Never against a population.

Architecture

74-module JavaScript core (v8.32.0, ~971 KB, SHA-pinned) wrapping six statistical primitives: Welford incremental moments, Bayesian Online Changepoint Detection, Normal-Inverse-Gamma baselining, hierarchical shared-factor arrays, weighted heteroscedastic baselines, and distribution-fitting under AIC / BIC / tail-weighted criteria. Five-layer stack: Engine → HSAPI open-standard query interface → reference app (FretMind) → vertical adapters → AI infrastructure. Specialized modules: DALD (deceptive-alignment divergence), VTACA (vitals-triggered cognitive-assessment), PBITE (practice-quality intervention gating), GRS (ground-truth labeling), RETRO (retroactive baseline construction).

What I specifically architected

Individual-baseline-only commitment: every module computes against Welford moments of one entity's history; no population priors anywhere. Moat ordering: dataset > hardware > implementation > network effects > patents, patents fifth on purpose. HSAPI as an open standard with a published patent pledge, licensable implementation underneath. Identity Separation Rule (Validator §45) keeping processor, scope, and obligation distinguishable in every output.

Pressure-testing

27 experiments pre-registered with SHA-256 hashes before any run script existed. Six honest losses to specialist baselines (Omori, ARIMA, others) documented rather than buried. Failed experiments forced API redesigns: AIC bias produced the DFB criterion API; BOCPD startup artifacts produced CPB burn-in routing; null-saturation produced three first-class null procedures. Comprehensive trademark, prior-art, and IP vetting across the engine; pulled USPTO PAIR export to verify the 43 provisionals were on file, caught an entity-status error and an RFC 3161 temporal-attestation gap. OSS license audit and SBOM confirmed no GPL / AGPL contamination.

What I did not do, and why

Cross-LLM arbitration on PIE's architectural claims was deliberately not performed: exposing claim language to other LLM providers was judged unacceptable IP-exposure risk during the provisional window. UPL / liability analysis held for the commercialization stage.

43 USPTO provisionals filed JavaScript · 74 modules Pre-registered experiment scaffold Open standard + patent pledge

The Hook

The same statistical substrate runs unchanged across guitar timing, wind turbine condition, surgical pressure, sensor drift, AI-output coherence, and seismic state. The DALD module applies it directly to deployed-AI monitoring: detecting when an AI's claimed effect diverges from independently-measured trajectory, the structural signature of deceptive alignment, without model-internals access. The engine is built to be falsified.

P-02 · Working Paper · Foundational Draft May 2026 · Available on Request

IBTR / TRIL · Individual-Baseline Truth Infrastructure

The foundational paper underneath the seven other projects in this portfolio: population reference is categorically wrong for individual-divergence questions, and one self-referential measurement substrate operates across domains as different as music, surgery, AI alignment, industrial machinery, and seismic monitoring.

The architectural claim

Most measurement systems compare a subject to a population. This works for triage, selection, and placement. It fails for a different class: has this specific subject changed? Is the gap between what this subject claims and what this subject does growing? For those questions the subject is the reference, not the population. IBTR is the methodological proposal for what first-class infrastructural support for self-reference looks like.

Two signals: stated capability (what the subject claims) and demonstrated capability (what independent measurement records). Signed divergence accumulates as a per-subject baseline via Welford's online algorithm (O(1) memory, O(1) update; no need to store raw observations). After enough observations the baseline locks; new observations score against it via standard z-thresholds. The math is decades old; the contribution is the architectural commitment, not the equations.

Cross-domain instantiation

Five worked-through domains in the paper share the same Welford accumulator and same z-classifier; only the sensor pipeline is domain-specific: music performance (intent vs. execution), surgical performance (procedural plan vs. motor control), AI alignment (model self-report vs. AI-affected reality), industrial machinery (reported state vs. sensor measurement), and geophysical monitoring (model prediction vs. measurement). One measurement substrate, five domains. The empirical content of the domain-agnostic claim.

What this is, what this is not

A conceptual and methodological paper, not an empirical one. Empirical validation is limited to one domain (instrumental music, in FretMind). The cross-domain claim is an architectural prediction backed by worked-through instantiations rather than completed empirical work. The framework produces signal rather than diagnosis: a high z-score means the divergence is statistically unusual for this subject; it does not identify fatigue or illness or deception. The conceptual framework is offered to the research community; no claim is made on the framing itself.

Welford + z-score classification Five-domain instantiation Foundational paper Companion to DALD (P-09)

P-04 · Cross-Domain Falsification Closed Cleanly on Null

WindPIE · The Engine in Wind

A narrowed extraction of PIE primitives applied to wind turbine fleet analytics. The hypothesis was that per-turbine individual baselines would beat fleet-mean comparison at surfacing subtle degradation. The data said otherwise.

Architecture

Python module on a primitives stack: Welford running statistics, multi-scale rolling-window baselines, bin-keyed conditional accumulators, per-turbine integrated baseline, CUSUM changepoint detector. Wind-domain layer using pvlib for atmospherics, IEC 61400 density correction, turbulence intensity, sector classification, wake-affected flagging. Validation harness ran against the CARE-to-Compare labeled dataset (Wind Farm A, 22 labeled events).

What I specifically architected

IBTR commitment (individual baseline only, no population comparison in the primary detection path) against the easier fleet-mean path. Pre-registration discipline applied to my own validation, as strictly as to claims I would make to others. SHA-256-hashed pre-registration locked hypothesis and decision rules before analysis. Held-out test split locked before any peek. Direct comparison against fleet-mean (peer) detection treating the IBTR architectural premise as the thing on trial.

What I did not do

No commercial due diligence (trademark, licensing, competitive audit, UPL); out of scope for methodology validation. Not claiming it.

Python · pvlib · CUSUM IEC 61400 density correction CARE-to-Compare dataset Pre-registered · SHA-256 locked

The Hook

I killed it cleanly. The SHA-256-locked pre-registration defined what null meant before I saw results. When results came in null (peer-first detection beating individual-baseline detection by roughly 5× on the labeled events) the closure was procedural rather than negotiable. Most portfolios showcase what someone shipped. This one is the discipline that made an unfavorable validation unambiguous, and the willingness to close a project I had architected because predictive capability was the bar I set and the evidence didn't clear it.

P-05 · Inverted-Architecture Vertical Mostly Killed · Data-Access Ceiling

SolPIE · PIE in the Sun

Built after WindPIE returned null on the original individual-baseline-first thesis. SolPIE inverted the architecture in response: peer-first detection, environmental conditioning instead of output conditioning, variance shift disabled by default. Engine and synthetic substrate built; integration test passes; real-data validation blocked by the same data-access ceiling that limits PIE itself.

Architecture

Python engine for module-level power electronics (MLPE) telemetry built on pvlib. Three-layer detection stack: PeerComparator (MAD-based robust z-scores) as primary detector, ConditionalImprovedBaseline (per-module conditional baselines on irradiance × ambient temperature) for characterization, CoherenceBaseline for multivariate joint divergence across voltage/current/power. ImprovedHybridBaseline composes NIG conjugate Bayesian inference, Welford master statistics, RobustEWMV with outlier clipping. Failure-mode profiles (soiling, cell crack, inverter capacitor aging) encode solar-domain knowledge as first-class configuration. Synthetic substrate built independently using pvlib solar geometry and Sandia cell temperature models, with failure signatures parameterized from PV reliability literature.

What I specifically architected

Peer-first detection as the primary architecture: reversing PIE's original individual-baseline commitment based on direct evidence from WindPIE that peer comparison won 5×. Environmental conditioning instead of output conditioning, to avoid the cascade-detection problem where any output-affecting failure makes other metrics look anomalous because their baselines were conditioned on the now-shifted output. Structural separation between synthetic generator and failure-mode profiles: both parameterized from PV reliability literature independently, never coupled, specifically to prevent closed-loop validation where synthetic data is tuned to make the detector look correct. Variance shift detection disabled by default despite being in the primitives, re-enabled per profile only with explicit justification.

What I did not do

No trademark or prior-art search. No competitive audit of existing MLPE analytics vendors. No regulatory review. Real-data validation impossible: vendor-grade per-module telemetry is not available to solo investigators at the resolution required to test the architecture against the soiling, cell crack, and inverter capacitor aging signatures the engine is designed to detect. Three unresolved engine issues are documented in source: per-module tolerance handling in peer comparison, uniform coherence_signal firing in the integration test, and variance_decay rule status for the capacitor profile.

Python · pvlib · MAD z-scores NIG conjugate Bayesian Physics-grounded synthetic substrate Architecture inverted from WindPIE null

The Hook

The discipline is in the response, not the kill. WindPIE refuted the original architectural thesis. SolPIE was built by taking those negative findings seriously and inverting the architecture: peer-first instead of individual-baseline-first, environmental conditioning instead of output conditioning, variance shift disabled by default. The inverted architecture passed integration on synthetic substrate at 4.5× event ratio on a soiling profile. Real-data validation is structurally blocked: vendor-grade MLPE telemetry isn't available to solo investigators. SolPIE sits at the same ceiling PIE itself sits at: public datasets and synthetic substrates can take the work to defensible architectural conclusions, but real predictive validation requires institutional data access.

P-06 · EU AI Act Compliance Killed on Competitive Landscape · May 8, 2026

Cardinal · EU AI Act Reference Architecture

A reference architecture for EU AI Act compliance infrastructure, scoped as documented specifications plus illustrative TypeScript implementation. Six articles in scope; three explicitly excluded as discipline. Killed within ~24 hours of a self-commissioned competitive ultrareview.

Architecture and scope

Reference architecture, not finished product. Documentation is the primary deliverable; code illustrates the documentation. Licensees take the architecture and build their own production systems against it. EU AI Act Articles 5, 9, 10, 12, 13, and 14 in scope across six functional module groups (core primitives, governance, audit, monitoring, oversight, transparency). Articles 11, 15, 17 explicitly excluded: Articles 11 and 17 as different problem space; Article 15 as different discipline and vendor category. Exclusions documented as deliberate boundaries.

What I specifically architected

Reference-architecture positioning: inverting the typical documentation-to-code ratio to match what sophisticated EU AI Act buyers actually want during the Goldilocks period. Three-reference claim discipline: every architectural claim required specification, illustrative implementation, and tests before any external use; claims without all three were removed. Multi-path commercial strategy preserved rather than committed prematurely: direct license, audit firm partnership, strategic partnership, acquisition. Grant-back clause: any licensee improvements grant back non-exclusively. IP integration with 43 USPTO provisional filings.

Pressure-testing: the discipline that produced the kill

After the May 7, 2026 foundation session produced full handoff documentation, scope, commercial strategy, and three identified warm EU contacts, the next gate before any outreach was a competitive landscape ultrareview I commissioned myself. It surfaced Microsoft Agent Governance Toolkit (AGT), released MIT-licensed and free on April 2, 2026, addressing substantially overlapping ground. I killed Cardinal on or around May 8, 2026, within ~24 hours of the discovery. No EU contacts were ever approached. The discipline rule the kill produced: competitive landscape ultrareview is a gate before commercial commitment, not a checkbox after sophisticated build.

EU AI Act Articles 5 / 9 / 10 / 12 / 13 / 14 Articles 11 / 15 / 17 excluded as discipline Grant-back licensing Three-reference claim discipline Killed within 24 hrs of ultrareview

The Hook

Cardinal's value as a portfolio artifact is not the architecture. It's the discipline of commissioning the competitive ultrareview as the gate before commercial outreach, accepting what it returned, and killing the project within 24 hours. The architecture had reached sophisticated commercial-ready state. That's what I killed. "We built something good" is not a reason to launch into a market a major platform may have already covered.

P-09 · Working Paper · Alignment Research Draft May 2026 · Available on Request

DALD · Deceptive Alignment Detection via Behavioral Baseline Trajectory Analysis

The alignment-specific application of IBTR / TRIL: detecting deceptive alignment in deployed AI systems without requiring model internals access.

The gap DALD addresses

State-of-the-art deception detection (mechanistic interpretability, activation-based methods such as LAT and POSER, self-monitoring architectures) shows 89–98% accuracy in controlled settings but all require model internals access, training infrastructure, or controlled evaluation environments. None work for the increasingly common case: a deployed AI accessed through an API, with no access to weights, where what we want to know is whether this specific deployment is producing genuine helpfulness or has converged toward a satisfying-but-not-helping pattern with this specific user over time.

The method

Per-user baselines of the divergence between AI-stated outcomes (c, what the AI claims about its effect) and independently-measured demonstrated outcomes (d, what actually happened). Sustained anomalous drift (high stated outcomes paired with flat or declining demonstrated trajectories) produces a MISALIGNED classification. The signature cannot be faked by the AI alone. The AI controls c; the AI does not control d. The user's baseline trajectory accumulates from observations made independently of the AI's presence. The math is the IBTR substrate (Welford accumulators, z-score thresholds); the architectural commitment is that alignment evaluation can be read from the human the AI affected, without needing access to the AI itself.

Positioning

DALD does not compete with mechanistic interpretability: interpretability cannot operate where the model is not accessible; DALD can. Conversely, interpretability identifies the specific circuit; DALD cannot. The two are complementary. Activation-based methods are vulnerable to recent prompt-level adversarial defeat (Daniels et al. February 2026, reducing auditor accuracy from 100% to 1–3% while auditor confidence stayed high). Trajectory-level signals require coordinated long-horizon manipulation across a specific user's history, a substantially harder optimization target than single-output adversarial defeat.

The worked example

DALD was first implemented in FretMind (P-03). AI guitar coaching system; c is the AI's claimed session quality; d is the user's independently-measured timing, pitch, and rhythm accuracy from the audio analysis pipeline against the score's reference signal, measured without consulting the AI's claim. Per-user baselines accumulate across sessions. The detection signature is high mean c paired with flat or declining baseline trajectory of d. The classical sycophancy failure mode mapped to a coaching context.

What this is, what this is not

A method rather than an algorithm. The contribution is conceptual rather than algorithmic. Empirical validation is preliminary: one domain, small scale. Cross-domain and at-scale validation is needed. The framework does not address catastrophic single-action misalignment: by the time a trajectory drifts, the harm has occurred. DALD is for slow drift; sudden failure is out of scope. Vulnerable to baseline poisoning; mitigations specified (changepoint detection, burn-in periods, hierarchical baselines) but not yet empirically validated against capable adversaries.

Welford + z-score classification No model internals access required Complements interpretability Trajectory-level (harder to spoof) Companion to IBTR / TRIL (P-02)

The Hook

An entire class of alignment failures can be detected from external observations of an AI system's effect on the people it acts on, without ever opening the box. The method is buildable now: no new mathematics, no new infrastructure, no model access required. The barrier is institutional rather than technical: agreeing that longitudinal individual monitoring is part of alignment evaluation infrastructure. The conceptual framing is offered to the research community for whatever use is found for it.

§ 03 · Capabilities

The skills the projects actually exercise.

C-01

Primitive design and cross-domain transfer

Build statistical and architectural primitives that compose. Test transfer across domains as an explicit hypothesis. Six vertical extractions of the PIE primitives: consumer skill measurement, wind turbine analytics, solar power electronics analytics, EU AI Act compliance, California solar advisory practice, and solar proposal verification.

C-02

AI output evaluation and adversarial arbitrage

Treat AI output as a hypothesis to be tested, not an answer to be accepted. Cross-LLM arbitration, fabrication catches, explicit refusal to claim diligence not performed.

C-03

Retrieval, evaluation, and agent governance

Citation-bound knowledge corpora with deterministic compilation and declared coverage gaps. Ground-truth evaluation banks with measured retrieval and false-confidence rates. LLM coding agents operated under written standing instructions with human approval gating every merge. Multi-section, role-defined prompts with built-in constraints and explicit failure modes; legal-doc drift audits with changelogs.

C-04

Liability-aware AI integration

Architect AI usage so the work product is defensible to clients, regulators, and counsel; structured against an attorney-reviewed engagement contract.

C-05

IP-aware architecture decisions

43 USPTO provisionals filed April 2026 (IDs 64036946–64039431, expire April 2027). Moat ordering (dataset, hardware, implementation, network effects, patents) applied as a design discipline.

C-06

Operational depth across roles and deal sizes

A career across sales, technical sales, and field application engineering: C-suite to field installer, five figures to seven. The pattern recognition, liability instinct, and cross-system reasoning the AI work is built on.

§ 04 · Hiring

If you're hiring for the seam between
AI capability and operational reality,

I'm the profile that doesn't show up in a standard candidate pipeline. A senior operating record across sales, technical sales, and field application engineering, the last several months of intensive AI building from a standing start, a support-knowledge console deployed in daily use inside a manufacturer's support organization, a document-intelligence engine in production with paying clients, a 74-module measurement engine with 43 USPTO provisional filings, two working research papers on alignment-relevant measurement infrastructure, two projects killed cleanly on bars I set in advance, an inverted-architecture solar vertical (SolPIE) built in direct response to what the wind work refuted, and an operating California advisory practice where AI is auditable production infrastructure under a published Terms of Service and attorney-reviewed engagement agreement.

Not a trained engineer or programmer. That's the point. The work above is what someone with my background builds when AI removes the bottleneck that would otherwise have required hiring an engineering team. And the discipline to instrument, falsify, and kill the work cleanly was already there, built by a career of carrying responsibility for outcomes in front of customers, lawyers, and regulators.

Primary target: Product Support / Customer Support Engineering Management roles at AI companies, where deep customer-support operations expertise and substantive AI literacy combine. The lateral move from solar industry customer support into AI customer support is deliberate: same function, adjacent domain, with demonstrated rapid AI adoption.

Sales-led roles: Enterprise Sales, Strategic Account Executive, Business Development, and Director-level commercial roles at AI and AI-infrastructure companies. A senior commercial track record (territory growth from $250K to $15M+, first to $1M quarter, first to $1M month, Director of Sales) paired with the AI portfolio above is a rare combination in the AI hiring pool.

Technical and customer-facing roles: Solutions Engineering, Forward-Deployed Engineering, Technical Sales Engineering, and Technical Product Manager at AI and AI-infrastructure companies. Also open to founding Solutions Architect or founding Product roles at AI startups under twenty people, where the seat pairs deep AI understanding with operational instinct and customer-facing credibility, and engineers own the implementation.

Get in touch Download CV

Generic version. Role-specific resume variants are submitted directly to each application.

Eleven projects. Five deployed, two killed on their own bars, two papers, one engine underneath.

I build operational tools and the primitives underneath them.And I instrument both so I can kill what fails.

Eleven projects, deployed work first.

P-10 · Governed Support-Knowledge Console

The Installer's Lens · Solar Advisory Analytics Platform

The Installer's View · theinstallersview.com

Beta Program Architecture · Residential Product Family NPI

FretMind · The Engine in Production

PIE · Predictive Individual Engine

IBTR / TRIL · Individual-Baseline Truth Infrastructure

WindPIE · The Engine in Wind

SolPIE · PIE in the Sun

Cardinal · EU AI Act Reference Architecture

DALD · Deceptive Alignment Detection via Behavioral Baseline Trajectory Analysis

The skills the projects actually exercise.

If you're hiring for the seam betweenAI capability and operational reality,

Eleven projects.
Five deployed, two killed on their own bars, two papers,
one engine underneath.

I build operational tools and the primitives underneath them.
And I instrument both so I can kill what fails.

If you're hiring for the seam between
AI capability and operational reality,