Download presentation
Presentation is loading. Please wait.
1
Knowledge Architecture in the Enterprise 2.0 Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services http://www.kapsgroup.com
2
2 Agenda Introduction Web 2.0: Folksonomies Search, Content Management, Text Analytics Conclusion
3
3 Folksonomies – Wikipedia Definition Folksonomy (also known as collaborative tagging, social classification, social indexing, and social tagging) is the practice and method of collaboratively creating and managing tags to annotate and categorize content. Folksonomy describes the bottom-up classification systems that emerge from social tagging. [1] In contrast to traditional subject indexing, metadata is generated not only by experts but also by creators and consumers of the content. Usually, freely chosen keywords are used instead of a controlled vocabulary. [2] Folksonomy (from folk + taxonomy) is a user generated taxonomy.collaborativelytagscategorize content [1]subject indexingkeywordscontrolled vocabulary [2] taxonomyuser generated
4
4 Web 2.0 – No need for Taxonomies etc.? “ Tags are great because you throw caution to the wind, forget about whittling down everything into a distinct set of categories and instead let folks loose categorizing their own stuff on their own terms." - Matt Haughey - MetaFilter Tyranny of the majority - worst type of central authority More Madness of Crowds than Wisdom of Crowds “Things fall apart; the center cannot hold; Mere anarchy is loosed upon the world,… The best lack all conviction, while the worst Are full of passionate conviction.” - The Second Coming – W.B. Yeats
5
5 Advantages of Folksonomies Simple (no complex structure to learn) – No need to learn difficult formal classification system Lower cost of categorization – Distributes cost of tagging over large population Open ended – can respond quickly to changes Relevance – User’s own terms Support serendipitous form of browsing Easy to tag any object – photo, document, bookmark Better than no tags at all Getting people excited about metadata!
6
6 Folksonomies – Problems and Limits Folksonomies don’t compare with taxonomies or ontologies Serendipity browsing is small part of search Limited areas of success – popular sites are popular Quality Content – finance, science, etc – not good candidates No mechanism for improving folksonomies Scale – Too Big (million hits) – Too Little (200 items) – Amazon and LibraryThing Need intrinsic value of tagging – not tagging for better tags Bad Tags - idiosyncratic or too broad, errors, limited reach – Most people can’t tag very well – learned skill
7
7 Del.icio.us Tags Design blog software music tools reference art video programming webdesign web2.0 mac howto linux tutorial web free news photography shopping blogs css imported education travel javascript food games Development inspiration politics flash apple tips java google osx business windows iphone science productivity books toread helath funny internet wordpress ajax ruby research humor fun technology search opensource Photoshop media recipes cool work article marketing security mobile jobs rails lifehacks tutorials resources php social download diy ubuntu freeware portfolio photo movies writing graphics youtube audio online
8
8 Del.icio.us - Folksonomy Findability Too many hits (where have we heard that before?) – Design – 1 Mil, software – 931,259, sex – 129,468 No plurals, stemming (singular preferred) – Folksonomy – 14,073, folksonomies – 3,843, both – 1,891 – Blog-1.7M, blogs – 516,340, Weblog- 155,917, weblogs – 36,434, blogging – 157,922, bloging – 697 – Taxonomy – 9.683, taxonomies – 1,574 Personal tags – cool, fun, funny, etc – Good for social research, not finding documents or sites – How good for personal use? Funny is time dependent
9
9 Library Thing Book people aren’t much better at tagging High level concepts – psychology (55,000), religion (120,000), science (101,000) Issue – variety of terms – cognitive science – need at least 40 other tags to cover the actual field of cognitive science Strange tags – book (19,000) – it’s a book site? Combination of facets and topics – Facets – Date (16 th century, 1950’s, 2007) // Function (owned, not read) // Type (graphic novel, novel) // Genre (horror, mystery) – Topics – majority like Del.icio.us
10
10 Library Thing – Book on Neuroscience 1) (Location: dining room)(1) biological(1) biology(8) box74(1) Brain(1) brain research(1) brains(1) cognitive neuroscience(1) cognitive science(1) consciousness(1) currently reading(1) HelixHealth(1) kognitionswissenschaft(1) medical(1) medicine(1) neuroscience(19) non-fiction(5) partread(1) Psychology(4) Science(10) textbook(10) theory(1)(Location: dining room)biologicalbiology box74Brainbrain researchbrainscognitive neurosciencecognitive scienceconsciousness currently readingHelixHealth kognitionswissenschaftmedicalmedicine neurosciencenon-fictionpartreadPsychology Sciencetextbooktheory Too General: Science, Psychology, biology, textbook Too specific: Location: dining room, box74 Facets: currently reading, partread
11
11 Better Folksonomies: Will social networking make tags better? Not so far – example of Del.icio.us – same tags Quality and Popularity are very different things Most people don’t tag, don’t re-tag Study – folksonomies follow NISO guidelines – nouns, etc – but do they actually work – see analysis Most tags deal with computers and are created by people that love to do this stuff – not regular users and infrequent users – Beware true believers!
12
12 Knowledge Architecture : Social Networking KM 2.0 (or 3.0?) – KM always concerned with social aspects of knowledge New relationship of center and users – more sophisticated support, more freedom, more suggestions, more user input – - New roles – for users (taggers, part of variety of communities – both distributed and central) – New roles for central – create feedback system, tweak the evolution of the system, Develop initial candidates Communities of Practice – apply to tagging, ranking – Community Maps – formal and informal – Map tags to communities – more useful suggestions – Use tags to uncover communities (see tech SNA)
13
13 Knowledge Architecture: Technology for Web 2.0 Enterprise Content Management – Place to add metadata – of all kinds, not just keywords – Policy support – important, part of job performance – Add tag clouds to input page – More sophisticated displays Tag clouds mapped to community map Tag clusters, taxonomy location Semantic Software – Inxight, Teragram etc. – Suggest terms based on text, on tag clouds Social Networking – add semantics – SNA – apply to people and tags Wiki – more powerful than blogs, more work to set up & maintain
14
14 Knowledge Architecture: Putting it all together Complexity Theory and Folksonomies: Feedback Ranking Methods – Explicit – people rank directly Categories, tags, taggers Good tags, best bets for terms or categories? – Implicit – software evaluation, reverse relevance Ranking Roles – Taggers – everyone (rewards, make it easy and fun) – Meta-taggers – everyone (but levels of meta-taggers) – Editors – tagging system, integration with taxonomy, resolve disputes, Wikipedia model
15
15 Knowledge Architecture: Content Structures – Best of Both Worlds Start and end with a formal taxonomy / Ontology – Findability vastly superior – Communication with others – share tags – Take advantage of conceptual relationships Tagging experience – folksonomies plus – Users can type any word – system looks it up – plurals, synonyms, preferred terms, spelling variations – Software suggestions – based on content of bookmark, document and on popular user tags – natural level not top down – New terms flagged and routed to central team Facets – for both things and documents (faceted taxonomy) – Software suggests facet values, user override – Cognitively simpler task than own value, complex hierarchy
16
16 Knowledge Architecture in the Enterprise Technology Text Analytics Content Management Search – Browse – Faceted Navigation
17
17 Varieties of Taxonomy/ Text Analytics Software Taxonomy Management Text Analytics – Auto-Categorization, Entity Extraction – Sentiment Analysis Software Platforms – Content Management, Search Application Specific – Business Intelligence
18
Vendors of Taxonomy/ Text Analytics Software Attensity Business Objects – Inxight Clarabridge ClearForest Data Harmony / Access Innovations Lexalytics Multi-Tes Nstein SchemaLogic Synaptica Teragram Wikionomy Wordmap Lots More 18
19
19 Why Taxonomy Software? If you have to ask, you can’t afford it Spreadsheets – Good for calculations, days of taxonomy development over – (almost) Ease of use – more productive – Increase speed of taxonomy development – Better Quality – synonyms, related terms, etc. Distributed development – lower cost, user input (good and bad)
20
20 Text Analytics Software – Features Entity Extraction – Multiple types, custom classes Auto-categorization – Taxonomy Structure – Training sets – Bayesian, Vector space – Terms – literal strings, stemming, dictionary of related terms – Rules – simple – position in text (Title, body, url) – Boolean– Full search syntax – AND, OR, NOT – Advanced – NEAR (#), PARAGRAPH, SENTENCE Advanced Features – Facts / ontologies /Semantic Web – RDF + – Sentiment Analysis
21
21 Current State of Content Management Content Management – Strong on management, weak on content – Content is a black box – simply moved around What is missing is the meaning dimension – In-depth and articulated understanding of content Perceived Solution – Delphi Survey – Taxonomy – 90% plan on taxonomy strategy in 24 months – 76% taxonomy is important Text Analytics integrated into CM - essential
22
22 Taxonomic Content Management: Work Flow with Meaning Preliminary Foundation Work – Design the ontology – Develop taxonomies – Design metadata standards – Collaborative development of controlled vocabularies Authors, SME’s – check document in: – Have a summary either written by human or software – List of metadata suggestions, entities – people, places, etc. – Provisional categorization – Decision: publish or submit for review, central team or community of experts. – Request for additional keywords or categorization issues
23
23 Taxonomic Content Management: Work Flow with Meaning Central Team – Review documents – easier, faster – Use summaries, metadata, entities to provide context – Review infrastructure requests – new keywords, categories Integrated Work Flow – Strengths of local and central – Variety of roles, flexible (few dedicated roles needed). – Collaborative categorization and keywords by SME, software, and central team SME’s can function as central team
24
24 Semantics and Search: An Integrated Approach: Elements Multiple Knowledge Structures – Facet – orthogonal dimension of metadata – Taxonomy - Subject matter / aboutness – Ontology – Relationships / Facts – Subject – Verb - Object Software - Text analytics, auto-categorization, entity extraction People – tagging, evaluating tags, fine tune rules and taxonomy People – Users, social tagging, suggestions
25
25 Faceted Navigation: Strengths and Weaknesses Strengths: – More intuitive – easy to guess what is behind each door 20 questions – we know and use – Dynamic selection of categories Allow multiple perspectives – Trick Users into “using” Advanced Search wine where color = red, price = x-y, etc.. Weaknesses: – Difficulty of expressing complex relationships Simplicity of internal organization – Loss of Browse Context Difficult to grasp scope and relationships – Limited Domain Applicability – type and size Entities not concepts, documents, web sites
26
26
27
27
28
28
29
29
30
30
31
31 Integrated Design – Facets & Semantics Design Issues - General What is the right combination of elements? – Faceted navigation, metadata, browse, search, categorized search results, file plan What is the right balance of elements? – Dominant dimension or equal facets Full Facets – Multiple intersecting filters – 1 or 2 filters (source / type) – No When to combine search, topics, and facets? – Search first and then filter by topics / facet – Browse/facet front end with a search box
32
32 Integrated Design – Facets & Semantics Design Issues - General Good Information Architecture – Space wars – summary or full facet display – Simplicity vs. research power – Source and Type are basics – Standard Facets – People, Companies, Place, Industry – Interactive interface – sliders, date ranges Semantics still hardest – summaries, related, rank Taxonomy – just another facet? – Keywords vs. simple taxonomy Tag Clouds / Clusters – how useful? Feedback – numbers of stories vs. top stories
33
33 Conclusion Web 2.0 is not the answer. The answer is: A knowledge architecture that integrates web 2.0 with: – Knowledge structures like taxonomies, facets, and basic level categories – Text analytics – categorization, entity extraction – Content Management with meaning – KA team that works with distributed groups in a variety of ways – A smart search that builds on a Knowledge Architecture that includes taxonomies, faceted metadata, best bets and search logs
34
34 Resources Information Today – Enterprise Search Sourcebook – 2008 Knowledge Architecture Approach to Search – Tom Reamy – Enterprise Search Sourcebook – 2009 CMS Watch – Content Management – www.cmswatch.com www.cmswatch.com Text Analytics Software – Information Today – www.infotoday.comwww.infotoday.com Web Sites – Taxonomy Community of Practice: http://finance.groups.yahoo.com/group/TaxoCoP/ http://finance.groups.yahoo.com/group/TaxoCoP/
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.