Download presentation
Presentation is loading. Please wait.
1
5-star Ratings & Recommendations with Mahout
Robin Bramley Chief Scientific Officer Ixxus
2
We are a leading global provider of end-to-end custom-built content solutions.
3
Our Alfresco Credentials
Long-standing Platinum Alfresco partner in US and UK Working with Alfresco since Alfresco v0.6 Excellent Alfresco knowledge and highly trained and experienced staff We are trusted to deliver some of the largest Alfresco projects in the World Alfresco Million $ Club (May 2012) Best Solution Partner (Nov 2013)
4
Award-winning projects:
Contributed to:
5
Presented at: Published in:
7
Discovering existing knowledge
How did we find answers 30 years ago? How was that information organised? Encyclopædia Library Bookshop Printed 7
8
The landscape changed "Updating dozens of books every two years now seems so pedestrian. The younger generation consumes data differently now, and we want to be there.” Jorge Cauz, Britannica, 2012
9
Number 6: “What do you want?” Number 2: “We want information.”
The Prisoner
10
Discoverability Metadata is key
Permits discovery through multiple dimensions
11
Finding stuff in Alfresco
A quick recap
12
Wordle: Browse Keyword~search Advanced~search Faceted~navigation Workflow Taxonomy Folksonomy~tags Dashlets Image~browsing Association~relationships Favourites Likes
13
Wanted to use the Anthrax Anti-Social single cover here – copyright stopped play
Audience participation exercise
14
Social content
15
Alice and Barbara I love my new iPhone 6 Me too!
Alfresco on iOS is great isn’t it? If you like Alfresco you should check out Robin’s Summit talk… Recommendations in a nutshell
16
Collaborative filtering
User similarity recommendations in a nutshell A B C 1 2 3 4 5
17
Alfresco 5-star ratings
5 star rating scheme supported by the Ratings Service Not exposed in Share Nod to metaversant / Jeff Potts’ 5 star Share extension
18
Demo time
19
Overview Diagram needs to be made clearer for projection
20
Technical details UML class diagram here?
21
5 stars give us preference level
Taste
22
The elephant in the room
23
Hadoop Hadoop was named after a stuffed toy elephant owned by the son of Doug Cutting who started the project Hadoop was extracted from the Nutch crawler Lucene sub-project and provides a scalable batch data processing framework using Map-Reduce on top of a distributed file system (HDFS). The use of Hadoop is beyond the scope of this session
24
Mahout started off as a sub-project of Apache Lucene
Portions of Mahout were* built on top of Hadoop The name is a Hindi word referring to an elephant driver * the project is moving over to Apache Spark
25
Recommendations Clustering Classification User or item similarity
Grouping similar documents Classification Reduce manual burden of assigning categories
26
RDBMS data source
28
Back to the demo
29
Overview Diagram needs to be made clearer for projection
30
Technical details
31
Sample Code { // extract avm store id and path
var fullpath = url.extension.split("/"); if (fullpath.length == 0) status.code = 400; status.message = "Store id has not been provided."; status.redirect = true; break script; } var storeid = fullpath[0]; var path = (fullpath.length == 1 ? "/" : "/" + fullpath.slice(1).join("/"));
32
Questions
33
Image credits Land Rover Discovery 3 Encylopædia Dewey Decimal
Encylopædia Dewey Decimal Book store Anti-social sign
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.