IWork

Selected work

A narrow selection of systems built for structure, coordination, and evaluation.

The page stays specific by design. It is not meant to be exhaustive.

Live instrument

Data interpretation studio

A restrained prototype for reading messy tables. It checks structure first, then offers a small number of comparisons worth attention.

More work

Further entries from the same body of work. Shorter treatment here, but not secondary in importance.

  • 02
    Personal knowledge system

    A maintained index for work, notes, references, and selected images — built as a long-form personal system rather than a portfolio shell.

    This site is designed less as a showcase and more as a maintained personal system. Work, notes, archive material, and static identity pages sit inside separate sectional logics so different kinds of material can accumulate without flattening into the same presentation style.

    The structure favors continuity over novelty. New entries are meant to be filed, placed, and revisited rather than endlessly pushed downward in feed form. Typography, spacing, navigation, and page rhythm are tuned so the site can hold analytical writing, visual fragments, project descriptions, and slower personal material without collapsing into a single undifferentiated voice.

    The system is intentionally quiet. Instead of adding visible product behavior everywhere, small page-specific details are used to create recognition and continuity: restrained motion, margin objects, sectional pacing, and surfaces that feel editorial rather than app-like.

    In that sense, the site functions as both interface and index: a place to present selected work, but also a framework for storing, extending, and shaping an ongoing body of material over time.

    Personal system
  • 03
    Model stress-testing and evaluation design

    STEM-grounded prompts, rubrics, and scoring structures built to make failure visible and comparable.

    Quantitative and STEM-grounded evaluation tasks written to expose where strong models still break under constraint: hallucination under pressure, brittle reasoning, weak instruction-following, and instability when answers had to remain verifiable rather than merely fluent.

    The work is adversarial only in a disciplined sense. Difficulty is used to separate surface fluency from sustained correctness, with ambiguity controlled so failures remain diagnostic. Rubrics keep those failures comparable across runs and tied to reproducibility.

    Evaluation work