The Importance of Good Data (and a good data model!)
ML largely relies on data sets in order to build the model. If you train a model off of bad data, or data that doesn't take into consdieration various nuances you could set yourself up for not only inaccurate results — but also problematic results through misinterpreting different segments of the world.
How to select data for modeling
Selecting data for modeling can appear overwhelming at first. The first thing to do is to approach the situation with an understanding of what you are asking.
Tackling your first dataset
When you approach a dataset — you'll need to look at all of the different columns in the dataset. Use the columns property of the DataFrame in order to do so.
input
melbourne_file_path = '../input/melbourne-housing-snapshot/melb_data.csv' melbourne_data = pd.read_csv(melbourne_file_path) melbourne_data.columns