Download presentation
Presentation is loading. Please wait.
Published byWesley Philip Peters Modified over 8 years ago
1
The Big ASC Working Lunch Agenda 1. Status: Where are we? 2. Planning: New timeline 3. RAs: Recruitment and Training 4. Schedule for recording: cities, regional centres 5. Technical issues: Black Box hardware SSCP Data management and storage: size/transfer Audio/Video synchronisation 6. Recruitment and retention of speakers 7. Where are YOU at? Ethics, contribution, RA recruitment, recording room ?
2
The Big ASC – Status Budget: almost all contributions are in Ethics: NEAF proposal completed and approved by UWS (lead site) Other sites getting their clearances (needed for recording) Equipment: BB prototype BBB Hardware: finalised and ordered Server ready: http://bigasc.science.mq.edu.au/http://bigasc.science.mq.edu.au/ Software: recruited Lei Jing as our programmer (6 mths) 1) Hardware management 2) Session management 3) User Interface A second 6 months position to work on database and server
3
The Big ASC – Status Public face: Website up: austalk.edu.au Corpus name: AusTalk Logo Inviting Australian celebrities to record My Country (invitations sent out. probably January) Media launch: end of January
4
The Big ASC: Funding ARC (LIEF) Original request:$1,296,525 Obtained: $650,000 (~50%) Contributions$435,000 Total Original request: $1,731,525 Obtained:$1,082,787 (~62.5%)
5
The Big ASC: Funding Each site receives: 1) 1 Black Box ($13.5K) 2) 2 days of training for RAs and IT support at UWS 3) Cash for data collection. For each participant recorded: 2 hrs of RA time per visit : $92, for 3 visits = $276* Participant payment = $25 per visit x 3 = $75 Participant Bonus for finishing all three sessions = $15 RA Bonus for collecting a complete set of data for each speaker = $25 Total = $391, rounded up to $400/speaker 4) Support for field trips to regional centres. Accommodation @ 100 p/n x 7 nights x 6 weeks Per diem @ $50 p/d x 7 days x 6 weeks Travel according to location
6
The Big ASC : new timeline startend Equipment & Trainingstarted30/06/2011 SSCP--31/01/2011 BB--31/01/2011 RA TrainingMid-February PilotsMid-FebruaryEnd February Data Collection01/03/201115/06/2011 At main sites01/03/201101/05/2011 regions01/05/201115/06/2011 Data Storage06/01/201129/06/2011 Data Annotation24/02/2010----
7
The Big ASC: Data Collection (AusTalk) STATEUNIVERSITYNUMBERSREGIONALNUMBERSOTHERNUMBERSTOTAL NSWUWS TOWNSVILLE48 MQ ARMIDALE48 UNSW48 EMOTION3684 USYD48 DISORDERED1664 QLDUQ120 VICMELB120GEELONG48 168 SAFLINDERS96ALICE/DARWIN48AUSAB48192 WAUWA96 TASUTAS48 ACTUC36BATHURST48 84 ANU48 TOTALS 660 240 1001000
8
AusTalk: Corpus Components Isolated Words Digits Read Sentences Interview Story: reading/retelling Map task Yes/No Emotions Purpose for each component? (Journal article)
9
The Big ASC: Standard Protocol Session 1Session 2Session 3 TaskTimeTaskTimeTaskTime Calibration + YES/NO 5 5 5 Words (HvDs/AusE/PolySyll) 10Words (HvDs/AusE/PolySyll) 10Words (HvDs/AusE/PolySyll) 10 Read Narrative5 Interview 15Map Task (First run)20 Re-told Narrative10Map Task (Second run) 20 Read Digits5 5 Read Sentences8 8 43 55
10
Video data: compression? Resolution: 640*480; 2 eyes: 2*640*480 = 614,400 pixels Frame per second: 48 Pixel: 16bpp (16 bit-per-pixel, raw16) For each second, for one stereo camera: 614,400*48*16 = 471,859,200 bits = 56.25 MB Each hour is about 200 GB 3 one-hour sessions per speaker: 200GB * 3 = 600 GB per speaker 1,000 speakers 600GB * 1000 = 600 TB for the corpus need 300 2TB external hard disks
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.