The website is in Maintenance mode. We are in the process of adding more features.
Any new bookmarks, comments, or user profiles made during this time will not be saved.

Machine Learning Resources

What is Data Leakage?

Bookmark this question

Data leakage occurs when information outside the scope of the training data is used in the model building process. This can induce unintended and unknown bias into the model that might not be discovered until it is not performing as intended when put into production. The best way to safeguard against data leakage is to have a robust validation procedure that ensures no portion of the validation data is used anywhere during the training process. 

Leave your Comments and Suggestions below:

Please Login or Sign Up to leave a comment

Partner Ad  

Find out all the ways
that you can

Explore Questions by Topics

Partner Ad

Learn Data Science with Travis - your AI-powered tutor |