Skip to content

Example Prompts

Here are 40 prompts to explore what Valency can do. Do you have other interesting examples? Feel free to share with us at feedback.valency.io.

  1. Map a gravitational wave researcher’s universe — who is D. Holz, what do they work on, and who do they collaborate with?

  2. Who are the top 5 most prolific researchers in quantum physics? Compare their profiles side-by-side — where do their interests overlap and diverge?

  3. Map Nima Arkani-Hamed’s research network. Who are his top collaborators, and how do their research interests compare?

  4. Who are the leading researchers in condensed matter nanoscale physics, and where do their interests diverge — superconductivity, topology, spintronics?

  5. Map the ATLAS collaboration at CERN. Who are the real collaborators once you filter out the mega-author-list noise?

  1. Find Google’s “Quantum Supremacy” paper, show me its revision history, and find the most similar work.

  2. I just read arXiv paper 2301.07041. What’s it about, how has it evolved, and what’s the most related work?

  3. I know the “Attention Is All You Need” paper. Find it, then show me the 10 most similar papers — just titles, no abstracts.

  1. Compare monthly publication volume in cs.CL, cs.LG, cs.AI, stat.ML, and q-fin.ST from January 2022 onwards. Which fields exploded after ChatGPT?

  2. Compare publication volume in astro-ph.CO, gr-qc, and hep-th over time. Is dark energy research growing or plateauing?

  3. Compare the AI safety research landscape across cs.AI, cs.CL, cs.LG, cs.MA, cs.CY, and stat.ML since 2020. Which fields are growing fastest?

  4. Compare publication trends in quant-ph, cond-mat.mes-hall, and physics.chem-ph from 2018 to 2025 at yearly granularity. Are quantum-adjacent fields growing in lockstep?

  1. Find papers at the intersection of quantum computing and portfolio optimization. They might be in q-fin.CP, q-fin.ST, or quant-ph — search across all of them.

  2. Find papers about applying reinforcement learning to protein folding — they could be in cs.LG, cs.AI, q-bio.BM, or physics.bio-ph.

  3. Find papers about using machine learning for astronomical transient detection. Search both semantically and by venue in MNRAS.

  1. What journals actually publish papers in hep-ex? Find experimental particle physics papers published in Physical Review.

  2. What journals publish quantitative finance papers? Can I find q-fin.ST papers in NeurIPS or the Journal of Financial Economics?

  1. How many CC-BY and CC0 papers exist in astro-ph.GA? Export a sample as CSV with just IDs, titles, and dates.

  2. I’m building a CC-BY dataset of NLP papers for model fine-tuning. How many are available, and can I get a sample export?

  3. Find all CC0-licensed papers in math.CO, count them, export the full set as CSV, and tell me which venues they were published in.

  1. What are the biggest research areas on bioRxiv and medRxiv? Show me the top domains on each and highlight where they overlap — e.g., do both servers have significant immunology or genomics communities?

  2. Track the COVID-19 research surge on medRxiv. Compare monthly publication volume in infectious diseases, epidemiology, and public and global health from June 2019 onward. When did each field peak, and have they returned to baseline?

  3. Search for papers about “AlphaFold” or protein structure prediction across bioRxiv. What categories do they land in — bioinformatics, biophysics, biochemistry? Plot the trend since 2020.

  4. Find papers about CRISPR-based therapeutics that span both bioRxiv and medRxiv. Search across genomics, genetics, and bioengineering on bioRxiv, and genetic and genomic medicine and oncology on medRxiv.

  5. Who are the most prolific researchers in single-cell genomics on bioRxiv? Compare the top 5 side-by-side — do any of them also publish on medRxiv?

  6. Compare publication trends in neuroscience on bioRxiv and neurology on medRxiv from 2020 to present at monthly granularity. Are basic and clinical neuroscience research growing in lockstep?

  7. I’m building a meta-analysis dataset of cancer biology preprints. Find CC-BY licensed cancer biology papers on bioRxiv, count them, export a sample as CSV, and tell me which venues they end up in.

  1. Find the “Attention Is All You Need” paper and show me its 20 most-cited citing papers. Which of those citers are in the Valency corpus so I can dig deeper?

  2. I want to understand how CRISPR research spread across disciplines. Find the original Doudna & Charpentier paper, then trace its citations chronologically — which fields picked it up first?

  3. What are the most-cited papers about “graph neural networks” published since 2020? Sort by citation count, not recency — I want impact, not novelty.

  4. Take the top 3 most-cited papers in reinforcement learning from 2023. For each, find who cited them and in what categories. Are the citers mostly in cs.AI, or are other fields like robotics and biology adopting RL?

  1. I’m <researcher name>, an astrophysicist at UC Berkeley. Based on my recent papers, what are some new directions I could take my research that make use of sentence transformers in innovative ways? Look at what’s trending in related fields and find unexplored intersections with my work.

  2. Pull up the profile and recent papers of Yann LeCun. Based on his research trajectory and the current hottest trends in cs.LG and cs.AI, what emerging subfields or cross-domain topics would be natural extensions of his work?

  3. I’m a neuroscience researcher — find my recent bioRxiv papers under “Eve Marder.” What topics are gaining traction in neuroscience and adjacent fields like bioinformatics and systems biology that I haven’t touched yet? Identify gaps between my work and emerging trends.

  4. Find papers semantically similar to “using large language models for drug discovery” across both bioRxiv and arxiv. Then find the most-cited among them. What do the highest-impact papers suggest about where this cross-domain field is heading?

  1. Plot yearly publication trends from 2000 to present for cs.AI, cs.LG, cs.CL, and a keyword-filtered subset of “symbolic reasoning” or “knowledge representation.” Overlay them. I want to see whether symbolic AI is resurging post-LLM or continuing to decline.

  2. Pull the author profile for Fei-Fei Li. Find her top 30 collaborators. For each of the top 10, extract their primary arXiv category distribution. I want to quantify: how many collaborators are primarily in cs.CV vs cs.AI vs medical domains (e.g., stat.ML, q-bio). Identify any individuals who act as bridges between core AI and medical imaging.

  3. Search for papers mentioning “cold fusion” or “low energy nuclear reactions” in physics categories. Plot yearly publication trends from 1989 to present. Then find the 10 most cited similar papers and inspect their revision histories. Did this field collapse or stabilize?

  4. For cs.AI, identify the 25 most prolific authors. For each, compute the entropy of their category distribution (how spread out they publish). Who are the most interdisciplinary vs most specialized AI researchers?

  5. Create a force-directed graph showing the people working on topics covered in the papers of Cambridge prof Austen Lamacraft. Repeat for Josh Bloom, the astrophysicist at UC Berkeley.