Presentation is loading. Please wait.

Presentation is loading. Please wait.

Edinburg March 2001CROSSMARC Kick-off meetingICDC ICDC background and know-how and expectations from CROSSMARC CROSSMARC Project IST-2000-25366 Kick-off.

Similar presentations


Presentation on theme: "Edinburg March 2001CROSSMARC Kick-off meetingICDC ICDC background and know-how and expectations from CROSSMARC CROSSMARC Project IST-2000-25366 Kick-off."— Presentation transcript:

1 Edinburg March 2001CROSSMARC Kick-off meetingICDC ICDC background and know-how and expectations from CROSSMARC CROSSMARC Project IST-2000-25366 Kick-off meeting Edinburg March 2001

2 CROSSMARC Kick-off meetingICDC NLP-based applications at ICDC Documents filtering –Syntactic analysis + NERC + Inference engine –Intranet and commercial internet Documents clustering –Statistical analysis Real-time documents indexing – Search engine techniques

3 Edinburg March 2001CROSSMARC Kick-off meetingICDC NLP- based prototypes at ICDC Shareholding events detection –Information extraction Documents filtering –transducers (CORAIL) –neural networks (TREC) Control techniques using machine learning –controlling filters with neural networks (RIAO) –controlling NERC with C4.5 (ADIET with NCSR)

4 Edinburg March 2001CROSSMARC Kick-off meetingICDC NLP-based applications Complex applications –development –exploitation –maintenance Heterogeneous modules –implementation: OS, language, communication, format –processing: data, resources, algorithms

5 Edinburg March 2001CROSSMARC Kick-off meetingICDC TalLab ICDC architecture for NLP-based applications –Operational since 1997 –Used in several applications and prototypes Publications –[Wolinski et al. 98] NLP+IA, Moncton –[Wolinski et Vichot 01] TSI, Paris Reference –[Cunningham et al. 00] LREC, Athens

6 Edinburg March 2001CROSSMARC Kick-off meetingICDC Guidelines for the design of TalLab Relying on a multi-agents model Reusing the OS wherever it is possible Refusing to impose a single standard

7 Edinburg March 2001CROSSMARC Kick-off meetingICDC Agents and circuits in TalLab messagesknowledge Accointances activity behavior persistence message box Agent Circuit of agents

8 Edinburg March 2001CROSSMARC Kick-off meetingICDC NLP techniques used in TalLab Tokenisation POS tagging Syntactic analysis Named Entity Recognition and Classification Semantic analysis Search engines Neural networks Finite state transducers Vector space model Statistical clustering

9 Edinburg March 2001CROSSMARC Kick-off meetingICDC Transistor-like agents Cardinality 1-N Cardinality N-1 Cardinality 1-1 Multiplier Dispatcher Switcher Filter TranslatorNetworker ConcentratorSynchronizer

10 Edinburg March 2001CROSSMARC Kick-off meetingICDC TalLab main features Malleability : plug & play architecture, easy prototyping Openness : reuse market components, low integration cost Efficiency: distribute applications, real-time, batch processing Exploitability: –Deployability full integration in the MIS –Reliability quality of service, robustness –Controllability monitoring facilities, surveillance tools

11 Edinburg March 2001CROSSMARC Kick-off meetingICDC Malleability Units of production = Circuits of agents Linking modules = Plugging agents

12 Edinburg March 2001CROSSMARC Kick-off meetingICDC Openness Integrating a component = Building a transducer Managing heterogeneity = Programming a translator

13 Edinburg March 2001CROSSMARC Kick-off meetingICDC Efficiency Pipeline architectureConcurrent architecture = Using multiplier

14 Edinburg March 2001CROSSMARC Kick-off meetingICDC Exploitability Deployability –distribution: sub-networks architecture –networkers: intranet proxies and internet firewalls Fiability –modularity: independence of agents –persistence: knowledge / message box / failures Controllability –uniformity: general controlling procedures –OS integration: connection to monitoring software

15 Edinburg March 2001CROSSMARC Kick-off meetingICDC ICDC technical expectations Adaptive techniques for information extraction from web pages Techniques for managing multilingual NLP- based applications Processing typical web texts (vs news items)

16 Edinburg March 2001CROSSMARC Kick-off meetingICDC ICDC applicative expectations Evaluation of the added-value of CROSSMARC in the context of CDC Exploitation of CROSSMARC by-products for competitive intelligence applications

17 Edinburg March 2001CROSSMARC Kick-off meetingICDC Intranet application at CDC Real-time news filtering and clustering –100 users, 100 topics Information retrieval –2 years of AFP economic news

18 Edinburg March 2001CROSSMARC Kick-off meetingICDC Internet application at CDC-Mercure Real-time news filtering –8,000 users, 80 topics

19 Edinburg March 2001CROSSMARC Kick-off meetingICDC IE prototype at CDC IE dedicated to shareholding events


Download ppt "Edinburg March 2001CROSSMARC Kick-off meetingICDC ICDC background and know-how and expectations from CROSSMARC CROSSMARC Project IST-2000-25366 Kick-off."

Similar presentations


Ads by Google