Back to Sessions

AI4RA Workshop | REACH 2026

The intersection between AI and data science

Led by Nathan Wiggins

Module structure

Foundations Module Preview

Part 1

AI4RA Introduction

10 min

Part 2

Data Science & AI Intersection

10 min

Part 3

Sandbox Time

20 min

Part 4

Decision Frameworks

10 min

Part 1 · AI4RA Introduction

AI4RA Objectives

1. Develop open-source data models and workflows

2. Create trustworthy AI-powered tools

3. Implement and assess the impact of these tools at partner institutions

Part 1 · AI4RA Introduction

Core Pillars for AI4RA

Accuracy

Evidence-based answers grounded in validated data sources.

Reproducibility

Repeatable workflows with transparent prompts, data, and methods.

Flexibility

Support for multiple institutional contexts, teams, and use cases.

Security

Policy-aligned controls for access, privacy, and operational risk.

Part 1 · AI4RA Introduction

Community of Practice Ecosystem

Vandalizer

AI workflow interface for structured analysis and reproducible outputs.

Data Lakehouse

Shared institutional data foundation to support trusted analytics at scale.

AI Literacy Course

Frameworks and companions for building institutional AI fluency across research administration.

Other Projects

Additional initiatives that extend AI4RA methods across research administration.

Part 1 · AI4RA Introduction

AI4RA: The Bridge to Research Analytics

Harnessing Data Science and Artificial Intelligence to:

Eliminate barriers that prevent institutions from performing meaningful strategic analysis
Manage growing expectations among stakeholders and institutional leadership
Prioritize consistent data governance and management principles

Part 2 · Data Science & AI Intersection

FAIR Principles as Operational Guardrails

Use persistent identifiers and searchable metadata.

Provide clear retrieval pathways with governed permissions.

Adopt shared formats and vocabularies across systems.

Document provenance, context, and usage constraints.

FAIR source: Wilkinson, M. D., et al. (2016). “The FAIR Guiding Principles for scientific data management and stewardship.” Scientific Data, 3, 160018. Graphic: Cloud-SPAN.

Part 2 · Data Science & AI Intersection

The Intersection of Data Science and Artificial Intelligence

Shift 1

Proactive Data Science

A Shift in Our Reach

Agentic coding and development tools make building new systems attainable in a short period of time.

Traditional data science backgrounds are well-aligned with advances in artificial intelligence.

Endless opportunities and development trees can lead to “Brain Fry.”

Brain Fry citation: Harvard Business Review. (March 2026). “When Using AI Leads to Brain Fry.”

Part 2 · Data Science & AI Intersection

The Intersection of Data Science and Artificial Intelligence

Shift 2

Reactive Data Science

A Shift in Stakeholder Values

Stakeholders now expect faster, more effective solutions powered by AI.

Perceived competition and fear of getting left behind drives initiatives.

Less appeal to dashboards, more appeal to chatbot interfaces.

Part 2 · Data Science & AI Intersection

Build a sentence one token at a time

Click the next word. Each choice shifts what comes next, just like an LLM.

No right answer, only probabilities. This is why LLMs produce different output each time.

Part 2 · Data Science & AI Intersection

Simplify the Approach

Step 1

Identify a black hole

Spot well-defined tasks that are creating unnecessarily large bottlenecks.

Step 2

Classify the task

Can data science or artificial intelligence be used to automate or collaborate?

Step 3

Build a simple solution

The solution must be less intimidating and complex than the original task.

Part 3 · Sandbox Time

Stepping Into the Vandalizer

Jump straight into the University of Idaho’s AI prompt workspace.

Launch Application

Open Vandalizer

Prompt Playground UI Research Support Fast Iteration

Part 3 · Sandbox Time

Exercise 1: Simple Extraction

Sample NCOD NOFO

Grant solicitation for the National Coalition of Donuts

Challenges

Create a list of 3-5 extraction terms and run the extraction

Automatically generate a list of extraction terms from the document and run the extraction

Download the results of the extraction

Part 3 · Sandbox Time

Exercise 2: Repeated Extractions

Sample NSF Award 1

NSF award to PI Radagast Brownleaf

Sample NSF Award 2

NSF award to PI Juniper Quillstone

Sample NSF Award 3

NSF award to PI RJ MacReady

Challenges

Run the pre-built NSF extraction process

Repeat the process for each PDF

Add an extraction term that you know isn't in the document

Part 3 · Sandbox Time

Exercise 3: Multi-Document Interactions

Sample Budget

Grant proposal budget for Frodo B. Underhill

Sample Budget Justification

Grant proposal budget justification for Frodo B. Underhill

Sample Research Strategy

Grant proposal Research Strategy for Frodo B. Underhill

Challenges

Using a prompt, compare the budget against the budget justification

Identify the inaccuracy

Add the Research Strategy document to the comparison and check for alignment

Part 4 · Decision Frameworks

Who owns AI-generated content?

US Copyright Office: AI-generated content without significant human authorship is not copyrightable
“Meaningful human contribution” is the standard — but the line is still being drawn
If AI writes your proposal narrative and you submit it unchanged, you may not own it
Training data raises separate IP questions: models learn from copyrighted material, but the legal landscape is unsettled

For research administrators

AI-assisted drafts need substantial human editing to establish authorship
AI-generated figures and data visualizations are especially murky
Institutional IP policies may not yet address AI — flag this gap
Sponsor terms of award may add additional constraints

Part 4 · Decision Frameworks

Does it matter how the work got done?

AI Appropriateness Continuum — process-critical to output-driven

Human leads — Every step must be auditable. AI may assist, but a human owns the reasoning.
Human in the loop — AI prepares, a qualified person reviews and decides.
AI leads — Only the result matters. AI can lead; validate with spot-checks.

Part 4 · Decision Frameworks

Classify these analytic tasks

Drag each slider. What do you think?

Faculty Workload Modeling

5.0

ROI Analysis

5.0

Financial Projections

5.0

Compliance Cycle-Time

5.0

Impact Reporting

5.0

Competition Intelligence Dashboards

5.0

Human Leads Human in the Loop AI Leads

Part 4 · Decision Frameworks

Data Science Tasks vs. AI Tasks: Then and Now

BEFORE

Careful distinction between tasks that are well-suited for “data science work” or “AI work.”

AFTER

Agentic workflows call specialized tools that are tailored to the target task

TRUE PRINCIPLE

Task fit matters for optimized results from artificial intelligence

Part 4 · Decision Frameworks

From Prompt Engineering to Intent Engineering

BEFORE

Significant emphasis placed on crafting effective prompts to optimize the output

AFTER

Better processes reduce the need for perfecting prompts with large, mainstream models

TRUE PRINCIPLE

Prompt design helps users get output that aligns with their intent

Part 4 · Decision Frameworks

Four disciplines for AI interaction

Your AI has a page limit — the context window

10% Intent

50% Information

25% Instructions

15% Conversation

Intent

Telosa

Information

Mnemos

Instructions

Promptulus

Conversation

Dialogos

Every AI model has a context window — a fixed limit on how much it can read at once. All four disciplines compete for that space. Getting the balance right is the skill.

Part 4 · Decision Frameworks

The AI Literacy Companions

Sequita

Auditability

Modulus

Decomposition

Telosa

Intent

Promptulus

Prompts

Mnemos

Context

Dialogos

Conversation

Vitrea

Transparency

Veridex

Evaluation

Clarion

Reporting

nate-layman.github.io/promptulus

Closing

Amid Constant Change, Data Foundations Stay Essential

✅ Quality

Validated, timely, and complete records.

⚖️ Governance

Clear ownership, policy alignment, and accountability.

🔐 Security

Access controls, monitoring, and incident response plans.

📝 Documentation

Lineage, assumptions, and known limitations.

10 Minute Break

Coming Next: The data lakehouse and data organization

As we explore how to do powerful things with your data foundation, start considering the potential you want to unlock with your data.

Return to session list

The intersection between AI and data science

Foundations Module Preview

AI4RA Introduction

Data Science & AI Intersection

Sandbox Time

Decision Frameworks

AI4RA Objectives

1. Develop open-source data models and workflows

2. Create trustworthy AI-powered tools

3. Implement and assess the impact of these tools at partner institutions

Core Pillars for AI4RA

Accuracy

Reproducibility

Flexibility

Security

Community of Practice Ecosystem

Vandalizer

Data Lakehouse

AI Literacy Course

Other Projects

AI4RA: The Bridge to Research Analytics

FAIR Principles as Operational Guardrails

The Intersection of Data Science and Artificial Intelligence

Proactive Data Science

A Shift in Our Reach

The Intersection of Data Science and Artificial Intelligence

Reactive Data Science

A Shift in Stakeholder Values

Build a sentence one token at a time

Simplify the Approach

Identify a black hole

Classify the task

Build a simple solution

Stepping Into the Vandalizer

Exercise 1: Simple Extraction

Sample NCOD NOFO

Challenges

Exercise 2: Repeated Extractions

Sample NSF Award 1

Sample NSF Award 2

Sample NSF Award 3

Challenges

Exercise 3: Multi-Document Interactions

Sample Budget

Sample Budget Justification

Sample Research Strategy

Challenges

Who owns AI-generated content?

Research Analytic Accountability

Does it matter how the work got done?

Classify these analytic tasks

Data Science Tasks vs. AI Tasks: Then and Now

From Prompt Engineering to Intent Engineering

Four disciplines for AI interaction

The AI Literacy Companions

Amid Constant Change, Data Foundations Stay Essential

✅ Quality

⚖️ Governance

🔐 Security

📝 Documentation

Coming Next: The data lakehouse and data organization