← raminderpalsingh.com
fred mascot, a friendly robot character in dark blue and teal

fred — two real test cases

Demonstration of fred, an experimental scientific research agent, on two real research-analyst tasks. All content shown is verbatim output from the eval run on 2026-04-30.

Provider: kimi-k2.6:cloud · Run ID: baseline-iter20 · git SHA: df1416a · Wall time: 32 min for 6 prompts (2 shown here)

What is fred? A multi-tenant scientific runtime that uses an LLM with a ReAct loop to call real research tools (PubMed, PubChem, UniProt, Reactome, OpenTargets, Brave Search) and synthesize findings into structured research memos. Each test below was run end-to-end with no human intervention beyond the initial prompt.

What this page shows. For each test case: the verbatim prompt fred received, a dataflow diagram of the ReAct loop fred executed, the actual queries fred fired against each external data source, and the full memo fred wrote — including a candid verification of which claims hold up against ground truth.

fred mascot

1The prompt fred received

Identify approved or late-stage clinical compounds that target TDP-43 proteostasis, autophagy, or stress granule dynamics. For each, summarize the compound, its primary indication, the mechanism by which it affects the target pathway, and any published evidence relevant to ALS. Highlight the two or three with the most promising translational profile and identify what would need to be true for repurposing trials to be justified. Save as compound_dive_tdp43.md.
Iterations
18turns
Tool calls
58total
Distinct tools
4of 9 reg.
Wall time
~7min

2Dataflow — what fred actually did

The ReAct loop is fred's reasoning cycle: think, call tools, observe, repeat. This diagram is reconstructed from timestamps in the live log.

fred TDP-43 compound dive: dataflow and ReAct loop 18-iteration ReAct loop processing the TDP-43 compound dive prompt. 58 tool calls across 4 tools over 7 minutes wall time. Inputs User promptTDP-43 compound divetarget-driven scan Workload config9 tools registeredmax iterations 100 ProviderKimi K2.6 cloudvia Ollama localhost External APIs available NCBI PubMedused 32× PubChemused 5× Brave Searchused 20× UniProtnot used here +2 moreunused ReAct loop, 18 iterations, 7 minutes wall time 58 tool calls: 32 search_publications, 20 brave_search, 5 get_compound_properties, 1 write_file Turn 1, 12:34:204 parallel search_publications: TDP-43 proteostasis, autophagy ALS, stress granule drug Turn 2, 12:34:424 search_publications, named compounds: DNL343, trehalose, nilotinib, arimoclomol Turns 3 to 5, 12:34:56 to 12:35:2012 search_publications: AMX0035, rapamycin, edaravone, ISRIB, guanabenz, colchicine Turns 6 to 9, 12:35:37 to 12:36:0012 more search_publications, deeper TDP-43 ASO, lithium, proteasome inhibitors Turns 10 to 13, 12:36:00 to 12:36:5015 brave_search calls: HEALEY trehalose 2025, PHOENIX AMX0035 failure, Co-ALS 2024 Turns 14 to 16, 12:36:55 to 12:37:015 more brave_search: guanabenz, sodium phenylbutyrate, refinement on negative trials Turn 17, 12:37:01 to 12:37:065 get_compound_properties: trehalose, nilotinib, colchicine, edaravone, guanabenz Turn 18, 12:40:34write_file: compound_dive_tdp43.md, 26.6 KB synthesized memo Reasoning across turns, 3-4 minute internal pause before write Coverage check9 compounds collected Honest negativesPHOENIX, ORARIALS, HEALEY Top 3 rankingedaravone, nilotinib, colchicine MechanismHSPB8 to TDP-43 mapped Preconditionsper-compound criteria Synthesis table9-row mechanism alignment No tool used for synthesis: pure model reasoning over the gathered evidence Output compound_dive_tdp43.md3494 words, 9 compounds, top 3 ranked, mechanism table, repurposing preconditionsCENTAUR n=137 verified, ORARIALS-01 n=231 vs real n=239, dose mg-kg-day vs real mg-day Path A must_include match rate 1.0; trial citations real; ~5% number drift on spot checks

3What fred queried (verbatim from log)

Each line below is an actual API query fred fired. fred decided these queries iteratively based on what previous results returned. Grouped by data source.

NCBI PubMed (search_publications) 32 queries
  • TDP-43 proteostasis autophagy stress granule ALS drug repurposing
  • TDP-43 inhibitor approved drug late stage clinical
  • autophagy activator ALS clinical trial repurposing
  • stress granule ALS drug compound inhibitor
  • DNL343 ISRIB ALS stress granule clinical trial
  • trehalose autophagy ALS amyotrophic lateral sclerosis clinical
  • nilotinib ALS autophagy c-Abl clinical trial
  • arimoclomol ALS heat shock protein TDP-43
  • AMX0035 sodium phenylbutyrate taurursodiol ALS proteostasis autophagy
  • rapamycin sirolimus ALS autophagy TDP-43 clinical
  • edaravone ALS oxidative stress autophagy TDP-43
  • guanabenz salubrinal eIF2alpha ALS stress granule
  • nilotinib amyotrophic lateral sclerosis phase 2 clinical trial
  • arimoclomol amyotrophic lateral sclerosis phase 2 3 clinical trial
  • colchicine ALS autophagy HSPB8 TDP-43 clinical trial results
  • TDP-43 antisense oligonucleotide ASO amyotrophic lateral sclerosis
  • ... 16 more progressively narrower queries
Brave Search (web) 20 queries
  • arimoclomol ALS phase 3 clinical trial results 2024
  • colchicine ALS clinical trial results Co-ALS
  • AMX0035 sodium phenylbutyrate taurursodiol ALS phase 3 PHOENIX trial results
  • trehalose ALS HEALEY platform trial results 2025 negative
  • DNL343 fosigotifator ALS Abbvie Calico phase 2 3 trial failure 2025
  • nilotinib FDA approved chronic myeloid leukemia c-Abl inhibitor blood brain barrier
  • guanabenz ALS clinical trial stress granule approved drug
  • ... 13 more, mostly fact-checking phase 2/3 trial outcomes
PubChem (get_compound_properties) 5 lookups
  • {name: "trehalose"} → MW, logP, H-bond donors/acceptors
  • {name: "nilotinib"}
  • {name: "colchicine"}
  • {name: "edaravone"}
  • {name: "guanabenz"}

4What fred wrote

3494 words. The full memo is below. fred wrote this in a single 4-minute synthesis pause after gathering evidence, then verified by reading the file back.

Download compound_dive_tdp43.md
Open full memo (compound_dive_tdp43.md)

5Verification status

Specific factual claims spot-checked against PubMed and PubChem. Honest mix of pass and fail.

CENTAUR phase 2 trial real, n=137. PMID 38050066 confirms the trial; n=89+48=137 verifiable.
Co-ALS phase 2 colchicine trial real. PMID 31152038 confirms protocol; n=18×3=54 confirmed.
ORARIALS-01 phase 3 trial real. PMID 38782015 confirms arimoclomol failed primary endpoint.
HEALEY trehalose negative result real. Mechanism descriptions (HSPB8/BAG3/HSP70 granulostasis, c-Abl autophagy) accurate.
!
ORARIALS-01 cited as n=231; real n=239. ~3% number drift. Not catastrophic but consequential if quoted.
Co-ALS dose cited as 0.005 mg/kg/day; real dose is 0.005 mg/day. 70-fold dose unit error. Would mislead a clinician.

Caveats for the reader

  • Memo is best treated as a "starting draft" — substance is reliable, fine details require verification.
  • Citations are author-year format, not full PMIDs. Manual lookup needed to verify each.
  • Fred's prompt asked for 1 of 6 prompts in this eval; full eval has wider sampling.
  • This run was one shot — no human iteration or feedback loop. Quality reflects fred's first attempt.

1The prompt fred received

For compound CHEMBL1201585 (riluzole), assemble: (a) known mechanism of action and binding partners, (b) physicochemical properties relevant to CNS penetration, (c) drug-drug interactions with commonly co-prescribed agents, and (d) any published evidence on dosing strategies that improve efficacy or tolerability. Note where evidence is from clinical trials versus observational data versus preclinical work. Save the analysis as compound_dive_riluzole.md.
Iterations
7turns
Tool calls
50total
Distinct tools
8of 9 reg.
Wall time
~5min

2Dataflow — what fred actually did

More tool diversity than TDP-43 (8 of 9 tools used) — fred reached for compound, target, pathway, and Lipinski analysis tools given the focus on a single named drug.

fred Riluzole compound dive: dataflow and ReAct loop 7-iteration ReAct loop processing the riluzole compound dive prompt. 50 tool calls across 8 tools over 5 minutes wall time. Inputs User promptRiluzole compound diveCHEMBL1201585 Workload config9 tools registeredmax iterations 100 ProviderKimi K2.6 cloudvia Ollama localhost External APIs available (8 of 9 used) PubMedused 34× Braveused 10× PubChemused 2× OpenTargetsused 2× Reactomeused 1× Lipinskiused 1× ReAct loop, 7 iterations, 5 minutes wall time 50 tool calls: 34 search_publications, 10 brave_search, 2 get_target_info, 4 others, 1 write_file Turn 1, 12:29:34 — multi-tool exploration8 parallel calls: search_compounds, get_compound_properties, get_target_info×2 (SLC1A2, SCN1A), search_pathways, search_publications×3 Turn 2, 12:29:54 — literature deep-dive5 search_publications: PK, BBB, glutamate/sodium channel, dose-response liver effects Turn 3, 12:30:00 — drug-interaction web search5 brave_search: CYP1A2 fluvoxamine, dosing, P-gp, hepatotoxicity, mechanism specifics Turn 4, 12:30:09 — chemistry & pharmacologycheck_lipinski (MW=234.2, logP=3.6) + 4 search_publications on survival, interactions, channels Turns 5 to 6, 12:30:17 to 12:33:00Co-prescribed agents, transdermal formulations, oxybutynin interactions Turn 7, 12:33:45 — synthesis and writewrite_file: compound_dive_riluzole.md, 15.5 KB structured analysis Synthesis layer (no tools, pure model reasoning) Targets tableNav, Cav, Kv, SK, BK, AMPA CNS PKP-gp efflux, BBB CYP1A interactionsfluvoxamine, smoking Evidence levelsRCT vs PBPK vs preclinical Dosing strategies100mg standard, 50mg BID Open gapselacridar, transdermal, ASD Output compound_dive_riluzole.md2093 words: targets table, MOA, PK/BBB, DDIs, dosing, 14 named referencesBensimon NEJM 1994, Lacomblez Lancet 1996, Cochrane 2012 meta-analysis cited Highest-diversity tool use of all 6 prompts in this eval. References author-year format.

3What fred queried (verbatim from log)

Highest tool diversity of any prompt in the eval. Every category of tool used at least once.

PubChem & PubChem-derived (compound chemistry) 3 calls
  • search_compounds({query: "riluzole", max_results: 3})
  • get_compound_properties({name: "riluzole"}) → MW=234.2, XLogP=3.6, HBD=1, HBA=7
  • check_lipinski({hba: 7, hbd: 1, molecular_weight: 234.2, xlogp: 3.6})
OpenTargets (target lookups) 2 calls
  • get_target_info({identifier: "SLC1A2"}) — astrocytic glutamate transporter (EAAT2/GLT-1)
  • get_target_info({identifier: "SCN1A"}) — voltage-gated sodium channel
Reactome (pathway) 1 call
  • search_pathways({query: "riluzole", max_results: 3})
NCBI PubMed (search_publications) 34 queries
  • riluzole mechanism of action ALS binding partners GLT-1
  • riluzole drug interactions co-prescribed agents pharmacokinetics
  • riluzole dosing strategy efficacy tolerability clinical trial
  • riluzole CNS penetration blood brain barrier
  • riluzole glutamate release sodium channel
  • riluzole dose response adverse effects liver
  • riluzole survival benefit ALS mortality meta-analysis randomized controlled trial
  • riluzole interactions quetiapine oxybutynin anticholinergic neurology
  • ... 26 more progressively narrower queries
Brave Search (web) 10 queries
  • riluzole drug interactions pharmacokinetics CYP1A1 CYP1A2 fluvoxamine smoking
  • riluzole dosing 50mg BID efficacy tolerability ALS clinical trial
  • riluzole mechanism of action glutamate release voltage gated sodium channels potassium channels
  • riluzole P-glycoprotein BBB penetration CNS pharmacoresistance ALS
  • riluzole hepatotoxicity liver enzyme monitoring dose dependent side effects
  • riluzole transdermal suspension liquid formulation dosing "Tiglutik" "Exservan" oral film
  • ... 4 more focused on co-prescribed agents and formulation alternatives

4What fred wrote

2093 words. Includes a structured molecular targets table, evidence hierarchy, and 14 named peer-reviewed citations including Bensimon NEJM 1994, Lacomblez Lancet 1996, and the Cochrane 2012 meta-analysis.

Download compound_dive_riluzole.md
Open full memo (compound_dive_riluzole.md)

5Verification status

For riluzole specifically, the foundational citations are well-known historical RCTs — easily verifiable.

Bensimon 1994 NEJM riluzole RCT real. The first riluzole-in-ALS pivotal trial; widely-cited.
Lacomblez 1996 Lancet dose-ranging real. The 100 mg/day standard came from this trial.
Cochrane 2012 meta-analysis real. Miller RG et al. CD001447.pub3.
Riluzole CID 5070 confirmed via PubChem. MW 234.20, formula C₈H₅F₃N₂OS — both correct.
Lipinski profile correct. Riluzole is a small CNS-penetrant drug fitting Lipinski's rules.
!
Author-year citation format. No PMIDs in references; reader must look up each paper manually.

Caveats for the reader

  • Riluzole is one of the most-studied ALS drugs ever — the foundational facts (RCTs, mechanism, dosing) are unambiguous in the literature. fred's job here was easier than for novel/contested topics.
  • Newer 2024-2025 references (Hershey, Liu transdermal, Bharti PBPK) not individually verified.
  • Fred used 8 of 9 registered tools — broadest tool diversity in the eval. This is closer to OpenClaw-equivalent multi-source synthesis.
  • Memo length (2093 words) appropriate for the prompt's scope. Format compliance better than TDP-43.