CoinWorld News reports that Anthropic has released BioMysteryBench, a bioinformatics benchmark consisting of 99 questions. The questions are created by domain experts based on real datasets (DNA/RNA sequencing, proteomics, metabolomics, etc.), with answers derived from the objective properties of the data or metadata verified through experiments, not relying on researchers' subjective judgment. In the evaluation, Claude Mythos achieved a 30% solving rate on 23 difficult human questions. The testing environment provides Claude with a container pre-installed with commonly used bioinformatics tools and access to public databases for downloading reference genomes.

View Original
This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.
  • Reward
  • Comment
  • Repost
  • Share
Comment
Add a comment
Add a comment
No comments