How to Publish a Research Paper in Data Science

A practical guide to publishing research in data science

A practical, field-specific guide to publishing research in data science, covering norms, challenges, the step-by-step process, where to publish, and how to choose between journal types.

Frequently Asked Questions

Do I need to release my dataset to publish in data science?

Increasingly yes. If legal or ethical constraints prevent release, a documented sample or synthetic dataset is usually expected, along with a clear justification.

How important is statistical significance in data science papers?

Very important for confirmatory work. Reviewers expect appropriate tests, multiple-run reporting, and discussion of effect size, not only p-values.

Can I publish replication studies in data science?

Yes. Replications and negative results are increasingly welcome, particularly in venues with reproducibility tracks and in structured low-cost journals.

What is the difference between a data science journal and an ML conference?

Conferences emphasise novelty and short review cycles; journals provide deeper review, archival permanence, and space for extended methodology and ablations.

Are notebooks acceptable as supplementary material?

Yes, when accompanied by a runnable environment specification such as a requirements file, conda environment, or Docker image.

Read this on EP Journals