Datasets

1. Ramen Ratings

  • Dataset Versions: [raw] [cleaned]

  • Source: Kaggle

  • Description: This dataset is an export of “the Big List” (of reviews) provided on The Ramen Rater, converted to the csv format. Each record in the dataset is a single ramen product review.

  • Attributes in the clean version:

    • Brand: the brand of the ramen product.
    • Variety: the product’s name on its label.
    • Style: packaging style (cup, bowl, tray, etc.)
    • Country: country or dependent territory the product was produced in.
    • Stars: the rating of the product on a 5-point scale.
  • Attributes in the raw version:

    • 'Review #': the order in which the ramen was reviewed.
    • 'Brand': the brand of the ramen product.
    • 'Variety': the product’s name on its label.
    • 'Style': packaging style (cup, bowl, tray, etc.)
    • 'Country': country or dependent territory the product was produced in.
    • 'Stars': the rating of the product on a 5-point scale.
    • 'Top Ten': the year and the rank if the ramen was ranked within the top 10, '' (empty string) otherwise.

2. Input Files Used in 9.1 Lecture Video/Notes

3. Green International Market Ramen

  • This is a simulated dataset created for the purpose of this class.
  • Dataset: [green-int-ramen.csv]
  • Attributes:
    • brand: the brand of the ramen product.
    • name: the product’s name on its label.
    • packaging: packaging style (cup, bowl, tray, etc.)
    • from: country or dependent territory the product was produced in.
    • sales: the average weekly sales in 2019 (these statistics are simulated).

Cheat Sheets


Software Setup