Amos Glenn - Portfolio

Design Challenge

Staff working on this national biomedical research program are regularly asked complex, highly technical questions by members of the public. These questions often touch on genetics, data privacy, research ethics, and medical terminology. To respond accurately, staff must not only remember the correct information, they must interpret it, adapt it to context, and explain it clearly at a wide range of literacy levels. The existing training approach relied almost entirely on written resources, especially model respenses to frequently asked questions.

There was no structured way for staff to practice forming responses and receive meaningful feedback.

When staff did practice, the feedback was often:

Inconsistent, depending on who reviewed it
Delayed, arriving after the moment of need
Difficult to scale, especially across distributed teams

Yet the stakes were high. Responses must be:

Accurate and complete to maintain public trust and comply with research guidelines
Consistent across the nation-wide organization
Supportive and understandable, regardless of the learner’s technical background

In short, we needed a way to provide scalable, individualized feedback that helps staff learn to communicate complex information confidently and responsibly.

Design Challenge

There was no structured way for staff to practice forming responses and receive meaningful feedback.

When staff did practice, the feedback was often:

Inconsistent, depending on who reviewed it
Delayed, arriving after the moment of need
Difficult to scale, especially across distributed teams

Yet the stakes were high. Responses must be:

Accurate and complete to maintain public trust and comply with research guidelines
Consistent across the nation-wide organization
Supportive and understandable, regardless of the learner’s technical background

In short, we needed a way to provide scalable, individualized feedback that helps staff learn to communicate complex information confidently and responsibly.

Instructional Design Solutions

Because this project addressed a high-stakes communication skill, I began with a proof-of-concept to demonstrate that scalable, feedback-driven practice could work before investing in a full production build with a development team. The proof-of-concept was built entirely in Python using free, open-source tools: no licenses, custom infrastructure, or developer time were required at this stage. The interface was intentionally minimal (a question, a text field, and a feedback panel) to support a rapid learning loop.

Using Available Gold-Standard Answers

In the production training environment, staff learn to respond using IRB-approved model answers. These internal answers are not public, but the All of Us website publishes public-facing FAQs based on the same source guidance. These public FAQs are accurate, written in plain language, and aligned in tone and intent with the internal training materials. For the proof-of-concept, I adapted these public FAQs as gold-standard benchmark answers for comparison. This ensured the demo respected IRB boundaries while still reflecting the communication approach expected in training.

Semantic Similarity Scoring

The demo ran on Hugging Face Spaces and used an open embedding model to generate semantic vector representations of (1) the learner’s written response and (2) the benchmark answer. The tool calculated cosine similarity between these vectors to assess how closely the meaning of the learner’s response aligned with the meaning of the approved content. This approach evaluates conceptual accuracy, not keyword overlap. The similarity score was then mapped to a percentage scale and a qualitative label (e.g., “Low,” “Moderate,” “High”) to make the results meaningful to the learner.

Generating Targeted, Actionable Feedback

In addition to scoring similarity, the tool used a lightweight language model to compare the learner’s response to the benchmark answer and to to identify differences in meaning. The tool then generated feedback text highlighting ways the learner's answer could be improved:

Which key ideas were present or missing
Why certain clarifications mattered
Where simplification or re-wording could improve clarity

Reinforcing the Approved Language Framework

The training program also uses an Approved Language Framework to help staff communicate in ways that are accurate, welcoming, and aligned with the values of the research program. Some commonly used phrases can unintentionally imply incorrect assumptions, introduce ambiguity, or make community members feel excluded. To support this, the prototype included a rule-based layer that scanned the learner's responses for a representative subset of these phrases using regular-expression matching. If a flagged phrase appeared in the learner’s response, the tool provided brief, pre-written guidance explaining the reasons for avoiding the phrase and alternative wording.

My Role

I led the end-to-end design and development of this proof-of-concept. I identified the instructional problem, evaluated solution approaches, and designed the learning experience around applied practice with timely, meaningful feedback. I selected the semantic similarity approach, adapted the benchmark responses from publicly available IRB-approved FAQs, and authored the feedback prompts and scoring logic to ensure the system reinforced clarity, accuracy, and trust.

I built the prototype interface and scoring pipeline using Python and free, open-source models running on Hugging Face Spaces, allowing the solution to be tested without custom infrastructure or engineering support. I also designed the rule-based layer that checked for phrases addressed in the program’s Approved Language Framework, and drafted the explanatory guidance used to redirect learners toward clearer and more inclusive alternatives.

Instructional Design Solutions

Using Available Gold-Standard Answers

Semantic Similarity Scoring

Generating Targeted, Actionable Feedback

Which key ideas were present or missing
Why certain clarifications mattered
Where simplification or re-wording could improve clarity

AI-Powered Learning

Developing scalable tools for practicing skills with confidence and precision.

Project Highlights

Helpful Gen-AI Feedback

Semantic Evaluation

Scalable Personalization

Design Challenge

Design Challenge

Instructional Design Solutions

Using Available Gold-Standard Answers

Semantic Similarity Scoring

Generating Targeted, Actionable Feedback

Reinforcing the Approved Language Framework

My Role

Instructional Design Solutions

Using Available Gold-Standard Answers

Semantic Similarity Scoring

Generating Targeted, Actionable Feedback

Reinforcing the Approved Language Framework

My Role

Try it Yourself

Demo Instructions