Data Mining on Food Recipes Kyung-Joong Kim March
Cooking Book
Nowadays …
Recipe Analysis
Idea Recipes on the Web Recipes Database Data Mining Knowledg e
Inside of HTML
Semi-Automatic Approach (with human experts)
New Approach Semi-Structured Structured
Recipe Types
Missing Data (1)
Missing Data (2)
Measures Alternatives Representative teaspoons tsp ts t teaspoon tablespoons tbs tb T tablespoon pounds lbs lb pound cups c cup
Ingredients
Cuts of Beef
Deletion of Words Small, Baby, Short, Thin, … Ground, Chopped, Crushed, Dried, Soaked, Cooked, … Fresh, Hot, Cold, … Round, …
Experiments Korean (104 recipes) –64 mastercook –40 meal-master Converting Results (Semi-Structured Structured) – 99 successful conversion – 5 failures
Why Failure? (2) No information (2 Cases)
Why Failure? (2) Unusual structure (3 Cases)
Ingredients Q) How many ingredients? A) 186 ingredients
Ingredients Per Recipe ± ingredients Max : 22 (Northern Cabbage Kimchi)
Spice A culinary term (not a botanical category) Plant products used in flavoring food and beverages
Spices onionpeppergarlic gingerchilipaprika mustardcinnamonparsley sesamescallionvinegar soy sauce
Spice Analysis # of Spices in Ingredient : 46 (24.7%) # of Spices in Recipe : ± Recipes with SpicesRecipes without Spices 93 (89%)11 (11%)
# of Recipes (Only Spices)
Meat vs. Vegetable Recipes Total Vegetable Recipes Meat Recipes # of Recipes10460 (57.7%)44 (42.3%) # of Ingredients8.356 ± ± ± # of Spices4.365 ± ± ± 1.753
Conclusions Automatic Analysis of Semi-Structured Recipes on the Web Insights from Data-Driven Analysis for Food
Future Works Apply the Analysis to Other Countries (Comparative Study) Apply Data Mining Algorithms on the Data