Presentation is loading. Please wait.

Presentation is loading. Please wait.

Collaborative Filtering and Rules for Music Object Rating and Selection Sifter Project Meeting Michelle Anderson Marcel Ball Harold Boley Nancy Howse Daniel.

Similar presentations


Presentation on theme: "Collaborative Filtering and Rules for Music Object Rating and Selection Sifter Project Meeting Michelle Anderson Marcel Ball Harold Boley Nancy Howse Daniel."— Presentation transcript:

1 Collaborative Filtering and Rules for Music Object Rating and Selection Sifter Project Meeting Michelle Anderson Marcel Ball Harold Boley Nancy Howse Daniel Lemire NRC, IIT Fredericton, NB, Canada June 19th, 2003 (Revised June 18 th )

2 How to implement industry standards for existing Sifter subprojects: RALOCA, COFI Music? Currently, several industry standards are in place to facilitate the description, search, storage, etc of Learning Objects.* An LO can be expressed as an entity with content surrounded by an outer shell of descriptive tags (metadata). * Learning Objects can be composed of multimedia content (images, video, sound), instructional content, learning objectives, or a combination of these different formats.

3 Learning Objects: Metadata components* LO General Meta Metadata Technical Life Cycle Educational Rights Relation Annotation Classification *Based on the SCORM Meta-data Information Model

4 Where do RALOCA and COFI Music come in? If these systems (or combined together to form one entity) can interpret relevant meta information about an LO based on current standards to provide interoperability these LOs can be sifted, weighted or compared. RALOCA / COFI Music -SCORM -CanCore -RSS-LOM -IMS LO repository

5 COFI Music by Nancy E. Howse

6 Collaborative Filtering Systems (COFI) Collects ratings from a number of users Recommends items to user based on correlations between ratings of current user and other users in database

7 Multi-Dimensional Ratings

8 Some Algorithms Average – O(1) Per Item Average – O(1) Pearson – O(m) Where ‘m’ is the number of users.

9 Some Admin Features… Add/remove items Remove users View a list of users and the number of items they have rated View the ratings of a user

10 RALOCA Rule-Applying Learning-Object Comparison Agent Marcel A. Ball National Research Council Institute for Information Technology e-Business

11 RALOCA RALOCA is a rule-based system for multi - dimensional comparison of learning objects (currently, music albums) based on jDREW Bottom-up (BU) with data represented in Object-Oriented RuleML. Part of Sifter Mosaic/NRC e-Learning project

12 The functionality of RALOCA COFI provides RALOCA with a table of predictions (summarized ratings) RALOCA uses a rule-based approach to combine the multi-dimensional predictions from COFI into a one-dimensional ranking of the items (objects)

13 RALOCA Architecture

14 Interfacing with COFI Music RALOCA builds on top of collaborative filtering technology from the COFI Music project (Nancy Howse) for ratings of the LOs Currently data is exchanged between RALOCA and COFI Music using Java serialization Currently has code in place to use the per item average algorithm We will use more advanced collaborative filtering algorithms, which will lead to better predictions

15 LO RuleML Representation product B00004YTYO Between the Bridges Sloan 15.99 6 8 6...

16 Modification Rules Modification rules allow the system to dynamically change values for the dimensions of a LO, based on information about the LOs, and the user profile. Example: There is a 5% discount for students buying roducts costing over $20.00. The modify relation has four roles: - amount- in our example this is '%-5' - variable- we want to change the 'cost' - product- a variable that will hold the asin of the LO - comment

17 modify %-5 5% discount for students cost ASIN isstudent yes product ASIN COST $gt COST 20 true Is the user a student?Is the cost greater-than 20? Retrieve asin and cost of the LO

18 XML Representation of n-Dimensional Object Ratings Ratings of (music, film, …) object will be on a scale from 0 to 10 COFI’s n-dimensional ratings of a given music object with some asin code can be represented in OO RuleML as ‘complex term’ (cterm) elements: –A rate value v becomes marked up as v. –Each v-rated dimension d becomes v. There is one rating cterm for every music object and for every rater: One row from the COFI prediction table

19 Two Sample Ratings For example, for object asinXYZ let us illutstrate 3 dimensions “lyrics”, “originality”, “performance”, as rated by 2 raters: rating 3 8 6 rating Cterm lyrics performance originality 3 8 6

20 rating 7 8 4 rating Cterm lyrics performance originality 7 4 8

21 COFI: Generating a Summarized Rating These ratings can act as a ‘training set’ of typical instances and a weighted representation can be inferred, e.g. using data-mining / collaborating filtering techniques: in this example just the arithmetic means, using the standard deviations to determine the significance (w) of the ratings. rating 5 8 5 The weights, w, on a scale from 0.0 to 1.0, reflect the raters’ agreements in each of the dimensions (weights add up to 1.0).

22 rating 6 2.4 rating 8 3.8 rating 3 1.6... Ranking by Standard Deviation product B00004YTYO Between the Bridges... 13 Traditional BMG

23 RALOCA: Retrieval Patterns A retrieval pattern can now be used to find a subset of ranked instances from (a ‘test set’ of ) many instances, based on the summarized rating. For example a user might specify desired (minimum) “lyrics”, “originality”, and “performance” ratings along with their weights. rating 8 7 6

24 XSLT: OO RuleML to “Song Rating XML” We can use XSLT to transform generic OO RuleML into a domain specific positional format (“Song Rating XML”) for rating of music objects rating 6 9 6 6 9 6

25 OO RuleML to Positional RuleMLTranslators XSLT Transformations 3 Step Process Similar to Unix pipes

26 OO RuleML Representation signature (database schema, template) implementation implementation... Apply to

27 applysig.xsl product Between the Bridges Sloan B00004YTYO 1997 13 Traditional 14.99 BMG Signature is applied to atoms with rel = product and fills in missing roles. All order is lost. product BMG … … B00004YTYO product

28 The order of the signature is applied to the atom when order = sorted. Otherwise _r’s are sorted by the n attribute nprmlsort.xsl product BMG … … B00004YTYO product … … product

29 oorml2prml.xsl product … … product product Metaroles (_r) are removed, leaving a positionalized version of each atom product B00004YTYO Between the Bridges Sloan 14.99 13 Traditional BMG

30 Relational Database Table 1: Ratings (ItemID, UserID, Dimension1, …) Table 2: Users (UserID, UserName, Password, …) Table 3: Comments (ItemID, UserID, Date, Comment, …) Table 4: Item (ItemID, Title, Author, …)

31 Free Text We will start collecting and displaying comments for two reasons: –add more content to our sites –allow further research by Anna Maclachlan and others

32 Conclusion COFI and RALOCA are specific to the music domain they describe, but they can easily be converted to describe various other e-Learning domains: movies, etc. This could be implemented to add an advanced rating / search feature to existing data-collecting systems.

33 Extra Slides

34 Learning Objects: Industry Standardization LO IEEE-LOM; provides structured Descriptions of re-usable digital Learning resources. RSS-LOM Module (translation) RSS 1.0; allows learning object repositories to syndicate listings and description of learning objects. Date LO / FEED AuthorTechnical Format X X Unique Identifier: (registry agency identifier number and time in milliseconds) RSS-LOM-Eval

35 Completing Missing Dimensions Taking the “performance” rating from the collaborative pattern, this will be expanded into the final retrieval pattern: rating 8 7 5 Possibly also the weights can be taken from the collaborative pattern (so omitting a dimension would not mean it has weight 0.0) rating 8 7 5

36 Scoring Rules The system uses a RuleML file to calculate the score of an LO. The only fixed relation within the scoring rule file is the 'score' relation, which has two arguments, one containing the 'asin' of the album (a unique identifier) and the actual score. Currently implemented as a normalized weighted sum Can be changed to implement another scoring scheme providing thatthe scheme can be calculated using the built in relations in jDREW –currently the following: addition ($add), subtraction ($sub), multiplication($mul), division ($div), summation ($sum), less-than ($lt), greater-than ($gt), square- root($sqrt).

37 RALOCA: Technologies used Object-Oriented RuleML – using the XSLT Translators written by Stephen Greene to convert Object-Oriented RuleML into positional RuleML, which can be interpreted by jDREW jDREW BU developed by Bruce Spencer modifications by Marcel Ball

38 Scale and Translation Invariant Algorithms Scale Translation User 1User 2User 3User 4User 5

39 Scale and Translation Invariant Algorithms Scale Translation User 1User 2User 3User 4User 5

40 Collaborative filtering I Correlates the current user’s ratings with those of other users. Collaborative filtering system correlate the provided ratings of the current user with the ratings of all other users of the database. to predict the current users’ ratings for unrated items.

41 Collaborative filtering II RALOCA user pre-rates 3 standard items/objects Ratings used for filtering similar raters’ ratings from COFI Music Similar means


Download ppt "Collaborative Filtering and Rules for Music Object Rating and Selection Sifter Project Meeting Michelle Anderson Marcel Ball Harold Boley Nancy Howse Daniel."

Similar presentations


Ads by Google