Discovery & data audit
Understand what you have and where you want to go. We audit data quality, frame the problem, and plan experiments.
- Problem scoping & success metrics
- Data quality assessment
- Technical roadmap & experiment design
R&D lab for hire. Start with an idea, a dataset, or a prototype. We'll deliver it ready to ship, publish, or present—in weeks, not months.
Rapid exploration, prototype baselines, and hard-nosed feasibility checks. See our work.
Prompting, rubric design, and reliability testing before production. Learn about our LLM chatbot work.
Offline/online metrics, ranking experiments, and user-impact analysis. Read our search engine case study.
We clean messy data, find meaningful patterns, and test hypotheses. Explore our small datasets research.
We build models that predict trends and alert you to unusual data points.
Clear documentation, evaluation reports, and deployment-ready guidance.
We run 4–8 week sprints. You get shipped code, research findings, or go-to-market demos. Not reports about what might work someday.
Understand what you have and where you want to go. We audit data quality, frame the problem, and plan experiments.
Build it. We implement baselines, test approaches, and run evaluations to prove what works.
Deliver it ready to use. Production code, documentation, monitoring, and team training.
Sergey Feldman is a Principal Research Scientist at the Allen Institute for AI (Ai2) and Head of AI at Alongside. He leads work on production-scale AI systems, including core infrastructure for Semantic Scholar and Asta. His expertise spans LLMs, machine learning, natural language processing, and scientific document analysis.
Sergey founded Data Cowboys to bring research-grade ML to organizations solving hard problems. Sergey holds a PhD in machine learning from the University of Washington and has been building AI systems since 2007.
Ilya Barshai is a Principal Research Scientist at McGraw Hill, where he applies AI and machine learning to educational technology. He brings over a decade of hands-on data science experience.
Ilya joined Data Cowboys in 2016, bringing a practitioner's approach to ML consulting focused on building solutions that actually ship. Before moving into data science, he spent eight years in risk and failure analysis of electromechanical product designs.
Sergey and Ilya grasped what we do the fastest of anyone I’ve explained our startup to. They are professional, timely, and so fast with their thinking and output.
Sergey has a wonderful knack for quickly scoping a problem, making efficient decisions on model choice and parameters, and clearly communicating throughout the process.
Sergey Feldman is brilliant, collaborative, candid, and lightning-fast. He has a special gift for explaining complex analytical concepts in terms anyone can understand.
Sergey provides well-considered results quickly with in-depth knowledge communicated clearly to readers at every level.
Data Cowboys’ work has been incredible. We understand our data, verify our impact, and keep people safe from exploitation.
We tested 108 datasets to see what really works with 100–1,000 samples.
Why models that perform identically in training can behave wildly in the wild.
How we improved Semantic Scholar search using three years of logs.
Making ML results understandable to collaborators.
Interview on moving ML systems from research to production.
Discussion on machine learning for search, consulting for Gates Foundation, AI for mental health, and career lessons.
Predicting child mortality with the CHAIN network using XGBoost and SHAP values to identify key physiological drivers for targeted interventions.
PyData Miami 2019 talk on double cross-validation for model selection and performance estimation on small datasets.
Let’s figure out what’s possible before you invest in full buildout.
Prefer email… Reach us at ilya@data-cowboys.com.