How to Publish a Research Paper in Data Science
A practical guide to publishing research in data science
A practical, field-specific guide to publishing research in data science, covering norms, challenges, the step-by-step process, where to publish, and how to choose between journal types.
Frequently Asked Questions
Do I need to release my dataset to publish in data science?
Increasingly yes. If legal or ethical constraints prevent release, a documented sample or synthetic dataset is usually expected, along with a clear justification.
How important is statistical significance in data science papers?
Very important for confirmatory work. Reviewers expect appropriate tests, multiple-run reporting, and discussion of effect size, not only p-values.
Can I publish replication studies in data science?
Yes. Replications and negative results are increasingly welcome, particularly in venues with reproducibility tracks and in structured low-cost journals.
What is the difference between a data science journal and an ML conference?
Conferences emphasise novelty and short review cycles; journals provide deeper review, archival permanence, and space for extended methodology and ablations.
Are notebooks acceptable as supplementary material?
Yes, when accompanied by a runnable environment specification such as a requirements file, conda environment, or Docker image.