The Dataset has been picked up from Kaggle. Instructions to scrap data from WineEnthusiast can be found here https://github.com/zackthoutt/wine-deep-learning .
The data consists of 10 fields:
- Points: the number of points WineEnthusiast rated the wine on a scale of 1-100 (though they say they only post reviews for wines that score >=80)
- Title: the title of the wine review, which often contains the vintage if you're interested in extracting that feature
- Variety: the type of grapes used to make the wine (ie Pinot Noir)
- Description: a few sentences from a sommelier describing the wine's taste, smell, look, feel, etc.
- Country: the country that the wine is from
- Province: the province or state that the wine is from
- Region 1: the wine growing area in a province or state (ie Napa)
- Region 2: sometimes there are more specific regions specified within a wine growing area (ie Rutherford inside the Napa Valley), but this value can sometimes be blank
- Winery: the winery that made the wine
- Designation: the vineyard within the winery where the grapes that made the wine are from
- Price: the cost for a bottle of the wine
- Taster Name: name of the person who tasted and reviewed the wine
- Taster Twitter Handle: Twitter handle for the person who tasted ane reviewed the wine