Presentation is loading. Please wait.

Presentation is loading. Please wait.

Training Course on EDIT 2013

Similar presentations


Presentation on theme: "Training Course on EDIT 2013"— Presentation transcript:

1 Training Course on EDIT 2013
For Users

2 Outline of the module Introduction
Using EDIT - integration with other tools Objects in EDIT for Users EDIT Graphical User Interface

3 A - Introduction

4 EDIT is a tool for data validation and data editing/imputation
What is data validation? - An activity aimed at verifying whether the value of a data item comes from the given set of acceptable values: What is data editing? - The activity aimed at identifying erroneous entries and correcting them if necessary. Example: the response is missing or incorrect.

5 How EDIT works? Define a format
A format contains a description of the data in a dataset A dataset is a set of data according to a specific format Define a format Define a program containing rules and file operations to be executed on the dataset(s) Uploads dataset(s) from external sources (e.g. CSV files) For users Execute the job Get the report containing errors (if any)

6 EDIT roles 'User‘ - Executes programs on datasets and accesses the reports. 'Programmer‘ - Manages the metadata needed by the user to execute programs; Implements 'formats‘; Implements ‘validation rules’ by means of 'programs'; Defines other operations on files by means of 'programs'; Sets up the configuration (if needed) relating to automatic processing, validation flows, connection templates, etc. 'Administrator' Manages users, domains and permissions.

7 'User' functionalities ‘Change Password’ ‘Dataset Import/Export’
Allows users to change their password; ‘Dataset Import/Export’ Allows users to import and export data to and from EDIT as well as monitor any ongoing import/export processes; ‘Job Execution’ Allows users to execute programs on imported datasets and view/export the results of the execution.

8 The ‘user workflow’ Data Import Job Execution Job Results Data Export

9 The link between ‘user workflow' and ‘user interface'

10 Accepted dataset(s) formats
SDMX-ML GESMES CSV FLR

11 GESMES (BOP ITS, BOP FDI)
UNA:+.? ' UNB+UNOC:3+FR2+4D :1637+IREF GESMES/TS' UNH+MREF GESMES:2:1:E6' BGM+74' NAD+Z02+ECB' NAD+MR+4D0' NAD+MS+FR2' IDE+10+EUROSTAT_BOP_01 reporting' DSI+BOP_FDI_A' STS+3+7' DTM+242: :203' DTM+Z02: :702' IDE+5+EUROSTAT_BOP_01' GIS+AR3' GIS+1:::-' ARR++A:FR:N:2:330:N:4A:E:9999:9999: :702:0:A:F+0:A:F+0:A:F‘ ARR++A:FR:N:2:330:N:4F:E:9999:9999: :702:0:A:F+0:A:F+0:A:F' ARR++A:FR:N:2:330:N:7Z:E:9999:9999: :702:0:A:F+0:A:F+0:A:F' ARR++A:FR:N:2:330:N:A1:E:1100:9999: :702:5824:A:F+5930:A:F+4204:A:F' ARR++A:FR:N:2:330:N:A1:E:1495:9999: :702:5828:A:F+5932:A:F+4206:A:F' CSV (with or without header) (SBS, CVTS,TOURISM) 9H; 2008; LT; 2; B-N_X_K642; 11930; 16236; ; ; ; ; UNIT; ; ; ; ; ; TT0; ; ; ; ; D08 9H; 2008; LT; 3; B-N_X_K642; 11930; 1001; ; ; ; ; UNIT; ; ; ; ; ; TT; ; ; ; ; D08 9H; 2008; LT; 4; B-N_X_K642; 11930; 529; ; ; ; ; UNIT; ; ; ; ; ; TT; ; ; ; ; D08 9H; 2008; LT; 30; B-N_X_K642; 11930; 17766; ; ; ; ; UNIT; ; ; ; ; ; TT; ; ; ; ; D08 9H; 2008; LT; 2; B-E; 11930; 1138; ; ; ; ; UNIT; ; ; ; ; ; TT; ; ; ; ; D08 9H; 2008; LT; 3; B-E; 11930; 104; ; ; ; ; UNIT; ; ; ; ; ; TT; ; ; ; ; D08 9H; 2008; LT; 4; B-E; 11930; 61; ; ; ; ; UNIT; ; ; ; ; ; TT; ; ; ; ; D08 FLR example 1 001E 001E 001E 001E FLR example 2 E ZZZZZ E ZZZZZ

12 B - Using EDIT - integration with other tools

13 Ways of using EDIT As a web-based application – called by other applications; Standalone – running on a PC; Client – server – running in a Data Centre.

14 EDIT as Web-based application https://webgate. ec. europa
Web-based Interface Unified interface for both the standalone version and the server deployment EUROSTAT Look & Feel Light interface, simplified workflows ECAS account is needed PS Web-based access is not intended for confidential data

15 EDIT running standalone
Downloadable package; Standalone installation supported by Windows XP and Windows 7; Simple installation wizard; Full functionality; Standard authentication is requested.

16 Client - server mode for EDIT
EDIT runs on a UNIX machine; The current setup is EDIT installed at Eurostat & other DGs; Contains all registered domains (= user specific workspaces) as by default imbedded; ECAS credentials needed for external users.

17 EDAMIS integration EDAMIS allows transmitting data files through a single entry point; EDAMIS can send data to EDIT by placing the files in a configurable location; EDIT detects metadata based on the EDAMIS naming convention; EDIT performs the processing in unattended mode.

18 SDMX integration Statistical Data and Metadata Exchange (SDMX) initiative is sponsored by seven institutions (the BIS, the ECB, Eurostat, the IMF, the OECD, the UN and the World Bank); SDMX describes and universalises the way to exchange statistical data and metadata; EDIT can import SDMX-ML datasets.

19 C - Objects in EDIT for Users
Datasets Programmes, Jobs

20 1 - Datasets Dataset is a collection of data rows structured according to a format; A two dimensional table composed by rows and columns: Columns correspond to the fields defined in the format; Records – no limit on size or number.

21 Dataset example –AES (Adult Education Survey)

22 Example: 'Format' – 'Dataset'

23 The same format – different datasets

24 2 - Programs, Jobs Program – a set of operations to be performed on a dataset defined by a specific format; No specific dataset is associated with a program, only formats (dataset definitions) should be specified; Job – the association between a 'Program' and concrete 'Dataset(s) Instance(s)'; Possible operations types of rules/checks: Single and Multiple column(s), Vertical and Hierarchical.

25 Job: error reports It contains:
View dataset– information about the job View statistics – summary of errors/statistics View detailed statistics report – downloadable excel file containing the summary of errors Export – new dataset containing errors can be exported here

26 Error report Error report is made up of errors contained in the imported dataset. Among other information, the following can be found: Rule name: The name of the program rule that failed; No of Failures: Individual rows that the error appeared through job execution; Rule Message: Rule’s error message as defined in the program.

27 Error report – view dataset

28 Error report - View statistics

29 Error report – View detailed statistics report

30 D - EDIT GRAPHICAL USER INTERFACE

31 EDIT - Log in (through ECAS)

32 Web-based access – not intended for confidential data

33 EDIT Home page EDIT 2013 User Manual EDIT Concepts (Tabs)
Your role in EDIT

34 Import dataset – file import
Locate your dataset Name your dataset Dataset Predefined information

35 Advanced configuration (I part)
If it is empty Click here to configure Threshold (can be changed) Choose among GESMES/ CSV/ FLR/ SDMX Fill in the information accordingly Properties can be saved (to be reused)

36 Advanced configuration (II part)
Select Format Reuse saved selection Click here when you are ready to import

37 Importing a CSV file with a header

38 Header definition meaningless
Fill in the parameters according to specificities of the dataset The order of the variables in the dataset has to be exactly the same as in the selected fields

39 Header definition meaningful
Fill in the parameters according to specificities of the dataset The order of the variables in the dataset DOES NOT need to match that of the selected fields box

40 Click on the triangle to see information about rejected cases
Import - failed Click on the triangle to see information about rejected cases Import status FAILED

41 Import– successful with warnings
Click on the triangle to see information about rejected cases Import status completed

42 Click here for imported dataset Import status completed
Import - successful Click here for imported dataset Import status completed

43 Hide columns from the showing dataset
Imported dataset Hide columns from the showing dataset Imported dataset

44 Fill in the information as needed and click on search
Search Dataset(s) Fill in the information as needed and click on search Export in CSV /FLR View dataset Delete dataset

45 Import / Export dataset(s)
Search criteria View, delete and download datasets

46 Create a Job – (I) choose a Job
Search criteria Available programs Launch, view (program) or export a specific job

47 Create a Job – (II) select parameters
Choose the appropriate dataset Rename (or not) the error log Click Execute Job

48 Create Job –(III) Job results
Search criteria View, delete & copy job(s)

49 Job detailed information View / export specific dataset
View Job details Job detailed information View error report/statistics/ detailed statistics / export error report View / export specific dataset

50 View error report information that can be hidden
Displaying error cases in the form of a error dataset

51 Validation rule which failed in the previous cases
RECORD SEX { CONDITION in(SEX,"MALES","FEMALES","TOTAL"); ERRMSG 'Value must match one of the codes' SEVERITY 'E' (SEX) ;

52 View Job statistics

53 View Job detailed statistics report – a downloadable file

54 Export error report (using CSV/FLR)
Choose file type & fill in the required parameters accordingly Hit export

55 View program (available in Create Job)

56 View Job statistics

57 Fill in the appropriate parameters Get the corresponding Job list
Search Job Fill in the appropriate parameters Get the corresponding Job list

58 Run validation flow – step I
A validation flow may be available

59 Run validation flow – step II
Locate dataset Name dataset Click start

60 Validation flow in progress

61 Run validation flow – step III
Job Details where error reports are accessible (same window as in View Job Details)

62 Useful links To EDIT page: http://ec.europa.eu/eurostat/edit
To VIPv page: CIRCAbc -> Eurostat -> VIP Validation Project Generic data validation and editing service: mailto: EDIT as web – client - CIRCAbc for: EHSIS: BoP: BoP:

63 Thank you for your attention!
Questions? Thank you for your attention!


Download ppt "Training Course on EDIT 2013"

Similar presentations


Ads by Google