EOVSA EST DPP Testing J. McTiernan EOVSA Prototype Review 24-Sep-2012.

Slides:

Advertisements

Similar presentations

CPU Structure and Function

Advertisements

Computer Architecture

Programmable Interval Timer

Computer Organization and Architecture

Processor System Architecture

Chapter 12 CPU Structure and Function. CPU Sequence Fetch instructions Interpret instructions Fetch data Process data Write data.

Computer Organization and Architecture

Mehmet Can Vuran, Instructor University of Nebraska-Lincoln Acknowledgement: Overheads adapted from those provided by the authors of the textbook.

Computer Organization and Architecture

Disk Access Model. Using Secondary Storage Effectively In most studies of algorithms, one assumes the “RAM model”: –Data is in main memory, –Access to.

FALL 2004CENG 351 Data Management and File Structures1 External Sorting Reference: Chapter 8.

CE6105 Linux 作業系統 Linux Operating System 許富皓. Chapter 2 Memory Addressing.

FALL 2006CENG 351 Data Management and File Structures1 External Sorting.

Chapter 5: Loops and Files.

Basic Input/Output Operations

Chapter 12 CPU Structure and Function. Example Register Organizations.

1 Programming & Programming Languages Overview l Machine operations and machine language. l Example of machine language. l Different types of processor.

Getting the O in I/O to work on a typical microcontroller Activating a FLASH memory “output line” Part 1 Main part of Laboratory 1 Also needed for “voice.

Mehmet Can Vuran, Instructor University of Nebraska-Lincoln Acknowledgement: Overheads adapted from those provided by the authors of the textbook.

‘C’ LANGUAGE PRESENTATION.  C language was introduced by Dennis Ritchie..  It is a programming language, which can make a interaction between user and.

INTERRUPTS PROGRAMMING

Group 5 Alain J. Percial Paula A. Ortiz Francis X. Ruiz.

1 Computer System Overview Chapter 1 Review of basic hardware concepts.

A COMPARISON MPI vs POSIX Threads. Overview MPI allows you to run multiple processes on 1 host  How would running MPI on 1 host compare with POSIX thread.

Moodle (Course Management Systems). Assignments 1 Assignments are a refreshingly simple method for collecting student work. They are a simple and flexible.

Khaled A. Al-Utaibi  Interrupt-Driven I/O  Hardware Interrupts  Responding to Hardware Interrupts  INTR and NMI  Computing the.

Fall 2012 Chapter 2: x86 Processor Architecture. Irvine, Kip R. Assembly Language for x86 Processors 6/e, Chapter Overview General Concepts IA-32.

Digital Packaging Processor Gordon Hurford Jim McTiernan EOVSA PDR 15-March-2012.

6 Memory Management and Processor Management Management of Resources Measure of Effectiveness – On most modern computers, the operating system serves.

External Storage Primary Storage : Main Memory (RAM). Secondary Storage: Peripheral Devices –Disk Drives –Tape Drives Secondary storage is CHEAP. Secondary.

1 Workshop Topics - Outline Workshop 1 - Introduction Workshop 2 - module instantiation Workshop 3 - Lexical conventions Workshop 4 - Value Logic System.

1 Linux Operating System 許富皓. 2 Memory Addressing.

Lecture 8 February 29, Topics Questions about Exercise 4, due Thursday? Object Based Programming (Chapter 8) –Basic Principles –Methods –Fields.

USB Project (15 th August) Ian Coulter. Last Week Needed to work error flags and error counting into labview. This seems to work successfully but doesn’t.

Moving Arrays -- 1 Completion of ideas needed for a general and complete program Final concepts needed for Final Review for Final – Loop efficiency.

Intel 8051 Another family of microcontroller is the Intel 8051 family. The basic 8051 microcontroller has four parallel input/output ports, port 0, 1,

VLBI_UDP An application for transferring VLBI data via UDP protocol Simon Casey e-VLBI meeting, Haystack 20 Sep 2006.

Loops and Files. 5.1 The Increment and Decrement Operators.

CompactLogix Project structure

EOVSA Pipeline Processing System J. McTiernan EOVSA Prototype Review 24-Sep-2012.

Memory Management OS Fazal Rehman Shamil. swapping Swapping concept comes in terms of process scheduling. Swapping is basically implemented by Medium.

Alternate Version of STARTING OUT WITH C++ 4 th Edition Chapter 5 Looping.

CSCI 156: Lab 11 Paging. Our Simple Architecture Logical memory space for a process consists of 16 pages of 4k bytes each. Your program thinks it has.

CS162 External Data Files 1 Today in CS162 External Files What is an external file? How do we save data in a file?

1 Device Controller I/O units typically consist of A mechanical component: the device itself An electronic component: the device controller or adapter.

Operating System Concepts with Java – 7 th Edition, Nov 15, 2006 Silberschatz, Galvin and Gagne ©2007 Chapter 0: Historical Overview.

Main Memory Cache Architectures

Microprocessor Systems Design I

Random access memory Sequential circuits all depend upon the presence of memory. A flip-flop can store one bit of information. A register can store a single.

Interrupts In 8085 and 8086.

Random access memory Sequential circuits all depend upon the presence of memory. A flip-flop can store one bit of information. A register can store a single.

Intro to Processes CSSE 332 Operating Systems

MapReduce Computing Paradigm Basics Fall 2013 Elke A. Rundensteiner

CENG2400 Revision Q1a A system has an ARM processor with a 32-bit General Purpose Input Output (GPIO) module. Two on/off switches are connected to bit-3.

Compiler Construction

Interrupts Interrupt is a process where an external device can get the attention of the microprocessor. The process starts from the I/O device The process.

Fundamentals of Data Structures

File I/O in C Lecture 7 Narrator: Lecture 7: File I/O in C.

Computer Architecture

Random access memory Sequential circuits all depend upon the presence of memory. A flip-flop can store one bit of information. A register can store a single.

CENG 351 Data Management and File Structures

ECE 352 Digital System Fundamentals

Chapter 11 Processor Structure and function

COS 518: Distributed Systems Lecture 11 Mike Freedman

Presentation transcript:

EOVSA EST DPP Testing J. McTiernan EOVSA Prototype Review 24-Sep-2012

Testing the DPP with EST data: The DPP is a multi-threaded process, using Open MP, written in C and FORTRAN. Input is correlator data, (state frame addition TBD). Output are visibility datasets in MIRIAD format. The different processing steps are run simultaneously, but can be thought of in sequence. Here is how the DPP test program for EST udp data (DPP_test_est12) works. Setup: There are four FORTRAN common block/C global external structures: Dppcorin holds the input packet data Pkt_register_in holds packet header information (packet number, accumulation number and packet address in the packet buffer Dppcorin.) Pkt_register holds packet info, sorted by packet number(0-8) and accumulation number. Procflags holds flags and pointers for buffer and packet buffer processing

Packet Buffer: N=NBUF*8 packets …….. Threads 1 and 2) Input, two streams of packet data. The streams are created from an IDL process that reads in one file, randomly decides whether a packet goes into one or another output files. These files are then used as inputs to the DPP_test_est12 program. The C routine est_read1 inputs packets from one file, est_read2 inputs from the other. Data from est_read1 is stored in the "top half", i.e., from 1 to N/2, data from est_read2 is stored in the "Bottom half", from N/2+1 to N. (This may change to even-odd in the future.) Packet header information is stored in another common block "pkt_register_in" in the same order as the input packets. Inputs are timed so that approximately 16 msec elapses for 1 accumulation. There is a flag associated with each position in pkt_register_in; it starts at value 255, is set to zero when a position is filled. When each half-buffer is filled, the processes loop back to the start. Pkt 3 Pkt 1 Pkt 2 Pkt 4 Pkt 5 Pkt 6 Pkt N

Thread 3) The subroutine dpp_fill_register orders the packets. It reads the pkt_register_in common block, starting with positons 1 and N/2+1, in input order, and the fills another common block, the "pkt_register" with packet information in order (i.e., for each accumulation number in order, packets 1 to 8 in order). There is a wait time associated with this step, (currently 100 msec). If a flag in the pkt_register_in does not go to zero during this time, then we move on.) There is a flag associated with each position in pkt_register; it starts at value 255, is set to zero when a position is filled. Also the flag for the packet_register_in is reset from 0 to 1, to denote that that packet has been ordered. For EOVSA, the state frame interpretation, assigning times and UV values to accumulation numbers could be included here. Once we have gone through the pkt_register_in buffer, we loop back to the start, and go again. Pkt_register_in: Pkt3Pkt4Pkt6Pkt2Pkt5Pkt7Pkt8 Pkt1 …

Pkt3Pkt4Pkt6Pkt2Pkt5Pkt7Pkt8 Pkt1Pkt2Pkt3Pkt4Pkt5Pkt6Pkt7Pkt8 Input 1:Input 2: Pkt_register_in to PKT_register: …

Thread 4) Now we have ordered packets, the next process, dpp_process_1, loops through the pkt_register and checks the flags for each accumulation, when all of the flags for an accumulation are zero, the packet addresses (The position in the packet buffer of the first 4-byte integer in the packet) are saved to an array. There is a wait time (currently 1 second) associated with this step. If the flags aren't all zero after this time, then we move to the next accumulation. Otherwise, we send the packet addresses to the subroutine DPP_PROCESS_DATAFRAME, G. Hurford's program that processes 1 accumulation. There is another flag associated with this processing. Starting at 255, if the process is successful, then the flag is set to 1. Otherwise the flag is set to 255. Pkt_register: Pkt2Pkt3Pkt4Pkt5Pkt6Pkt7Pkt8 Pkt1

Thread 5) The next process, dpp_process_2, loops through the process flag array and checks for a flag value of 1. There is a wait time (currently 1 second) associated with this step; if the flag isn't one after this time, then we move to the next accumulation. Otherwise, we call DPP_PROCESS_SPECTRAFRAME, G. Hurford's program that (currently) processes 1 accumulation, (but for EOVSA operations will process 1 second of data). If the process is successful, then the flag is set to 2. Otherwise the flag is set to 255. For EST data, the output program est_data_write, which calls MIRIAD routines to open and output the data, is called in this step. For EOVSA this will be farmed out to a separate process. Acc NBUF Pkt1 Flag Buffer: Size NBUF …… NBUF

Testing the DPP with EST data: This process currently works. At least it outputs uvlist-able output, and amplitudes and phases correspond to what is expected from examining the visibility output into text files. When multi-threading, it turns out that flagging is not sufficient to avoid segmentation faults. So in addition to a flag value, each process gets a pointer, to insure that the following process never catches up. There is no checking to see if processing of a given packet is finished before overwriting its position in the packet buffer. It's assumed that processing 20 msec of data will take less than 20 msec (hopefully much less). We can split processes if necessary; i.e., one process could do the even buffer positions, one odd, or 3 processes, where one does every third buffer position, etc… Each process has a done_flag, so that everybody can know when to stop. Each process checks the flag for the process before it before starting a new loop through the buffers. Processing ends when the last process has gone through the buffers twice after the previous process ends.

Hardware? The final version should have 7 threads, so a large number of processors is a good idea. The current version runs well on an 8-processor machine, (2 quad-core processors) but significant slowness occurred when expanding from 4 to 5 threads. We are guessing that is due to the fact that one of the quad-core processors was running system tasks, but really have no clue. In any case, we would like at least a 16 processor system (2 octo-core) with 32 to 64 Gbytes of RAM. (Note that my current desktop with 8 processors and 32 Gb RAM was $5000 in 2009).