Presentation is loading. Please wait.

Presentation is loading. Please wait.

Project Data Flow.

Similar presentations


Presentation on theme: "Project Data Flow."— Presentation transcript:

1 Project Data Flow

2 SAMBAH - a case study Is the Baltic harbour porpoise extinct?
One Data Manager - Daniel Wennerberg 9 countries and languages. Multiple Lat/Long formats from ships (see freeware on Chelonia website) 400 years of data from >300 C-PODs. 29 billion clicks, 4 billion porpoise-like clicks 6 million clicks in trains from the Hel1 classifier. <1 false positive sec/year Outcome: ? designation of 3 marine conservation zones. 

3 this scheme is the best! File naming
Island SE FPOD_0004 file0.FP1 this scheme is the best! Location date of file start POD identifier file number type this deployment Keep all files in one directory Use stable location names If you want to use the array viewer there are rules on the first 3 characters Let CPOD.exe construct the name then File lists in Windows can be sorted by place, deployment date, POD number, or by file type, or file date Batch exports are quick and easy

4 Archiving Check length of CP1 or FP1 after reading SD card. Does it match the deployment? More checks if it doesn't match… Archive all CP1 / FP1 files, plus metadata spreadsheet. That's the minimum. CP3 / FP3 files with detections can be regenerated later if needed.

5 File listing FileName POD starts ends startMin endMin Logdays
Muara Pahu POD1600 file01.CP1 1600 25/11/ :55 29/01/ :16 64.64 Muara Muntai POD2840 file01.CP1 2840 25/11/ :32 03/02/ :17 70.41 Muara Kedang Kepala POD2837 file02.CP1 2837 29/12/ :22 31/01/ :18 33.75 Muara Kedang Kepala POD2837 file01.CP1 21/11/ :24 29/12/ :44 38.22 Muara Kaman POD2836 file01.CP1 2836 23/11/ :30 29/01/ :58 67.15 Pela River Mouth POD2839 file01.CP1 2839 23/11/ :14 30/01/ :49 67.9 Muara Belayan POD2838 file02.CP1 2838 31/12/ :27 26/01/ :29 25.79 Muara Belayan POD2838 file01.CP1 21/11/ :26 31/12/ :26 40.25 Mahakam POD1600 file02.CP1 25/09/ :29 08/10/ :16 13.2 Mahakam POD1600 file01.CP1 04/08/ :22 25/09/ :28 51.34

6 Metadata Keep a spreadsheet of all deployments: place, POD start time, deployment time, mooring type, local conditions. Use the data fields in the files. Can be exported. to change embedded data

7 Checking files after reading the file from the SD card, look at the whole file: for discontinuities in sound, temperature, angles

8 Backup Backup Backup Backup Backup BackupB
…if you don't have 3 copies of your data, you don't have data Aaargh

9 Cropping files after reading file from SD card

10 Cropping files Select point after the boat noise has gone
Right-click pop-up menu Set selection start go to end … Set selection end or use automated cropping … crops to midnight

11 Cropping files copy & crop Kawda, Sarjekot POD1592 file01 PART 206d 5h 28m.CP1 then save cropped file in 'Useful Files' folder for project

12 Kawda, Sarjekot 2016 05 10 POD1592 file01 PART 206d 5h 28m.CP1
Kawda, Sarjekot POD1592.CP3

13 Folders and processing
Archive folder : all CP1 files. Working folder : 'Useful files': all useful CP1, CP3, cm1, cm3 files cm1, cm3 files are essential map files. They can be safely deleted, but will be rebuilt when required! Time selections can now be used for other purposes Re-run train detection

14 Train detection The big one! Does multiple hypothesis testing. Slow.
Encounter classifiers can be run and re- run without repeating KERNO … if there cannot be any NBHF species e.g. if there are VEMCO fish tags

15 Decide on encounter classifier
KERNO - widely used for porpoise studies. Often gives low sensitivity for dolphins. GENENC - often gives better ( x2) sensitivity for dolphins and reduces false NBHF detections from dolphins at a cost of slightly reducing NBHF sensitivity. Hel1 - only detects porpoises. Assumes no dolphins occur, so it may give false positives from dolphins. Developed specifically for the Baltic using training data from the Hel Marine Station, Poland. KERNO-F - under development. Uses F-POD data. Wide range of training data needed. Please give generously! Choice based on validation of a sample of detections.

16 Validation tools Inspect trains - the key test - see 'Validating detections' Analysis page - shows the impact over the whole file of different criteria

17

18 DATA issues Difficult issues:
see Choice of Filters and Stats Difficult issues: How much of the time are dolphins silent? Can dolphin group size be determined from acoustics? Is bottom feeding missed in behavioural studies? Diel patterns of detectability - all species. Seasonal change in this? Deployment depth effects. Halocline, thermocline ( known together as the pycnocline) effects? Few parameters are normally distributed. Do you need a detection function? Or can you use a trend?

19 thank you for your attention


Download ppt "Project Data Flow."

Similar presentations


Ads by Google