Question 1

Do I need to release my dataset to publish in data science?

Accepted Answer

Increasingly yes. If legal or ethical constraints prevent release, a documented sample or synthetic dataset is usually expected, along with a clear justification.

Question 2

How important is statistical significance in data science papers?

Accepted Answer

Very important for confirmatory work. Reviewers expect appropriate tests, multiple-run reporting, and discussion of effect size, not only p-values.

Question 3

Can I publish replication studies in data science?

Accepted Answer

Yes. Replications and negative results are increasingly welcome, particularly in venues with reproducibility tracks and in structured low-cost journals.

Question 4

What is the difference between a data science journal and an ML conference?

Accepted Answer

Conferences emphasise novelty and short review cycles; journals provide deeper review, archival permanence, and space for extended methodology and ablations.

Question 5

Are notebooks acceptable as supplementary material?

Accepted Answer

Yes, when accompanied by a runnable environment specification such as a requirements file, conda environment, or Docker image.

How to Publish a Research Paper in Data Science

Frequently Asked Questions

Do I need to release my dataset to publish in data science?

How important is statistical significance in data science papers?

Can I publish replication studies in data science?

What is the difference between a data science journal and an ML conference?

Are notebooks acceptable as supplementary material?