Writing up.

Slides:



Advertisements
Similar presentations
Critical Reading Strategies: Overview of Research Process
Advertisements

Academic Writing Writing an Abstract.
HOW TO WRITE AN ACADEMIC PAPER
Introducing Formal Methods, Module 1, Version 1.1, Oct., Formal Specification and Analytical Verification L 5.
Style for Special CS Components. Mathematics “Our confidence in any science is roughly proportional to the amount of mathematics it employs” (Bronowski.
Chapter 12 – Strategies for Effective Written Reports
Learning Objectives Explain similarities and differences among algorithms, programs, and heuristic solutions List the five essential properties of an algorithm.
Chapter 10 Algorithmic Thinking. Copyright © 2013 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Learning Objectives List the five essential.
Copyright © 2003 by The McGraw-Hill Companies, Inc. All rights reserved. Business and Administrative Communication SIXTH EDITION.
Essays IACT 918 July 2004 Gene Awyzio SITACS University of Wollongong.
Discrete Mathematics Lecture 4 Harper Langston New York University.
Technical Writing II Acknowledgement: –This lecture notes are based on many on-line documents. –I would like to thank these authors who make the documents.
UMass Lowell Computer Science Advanced Algorithms Computational Geometry Prof. Karen Daniels Spring, 2004 Project.
Cmpt-225 Algorithm Efficiency.
Algorithm Analysis CS 201 Fundamental Structures of Computer Science.
Basic Scientific Writing in English Lecture 3 Professor Ralph Kirby Faculty of Life Sciences Extension 7323 Room B322.
© 2006 Pearson Addison-Wesley. All rights reserved10 A-1 Chapter 10 Algorithm Efficiency and Sorting.
Cmpt-225 Simulation. Application: Simulation Simulation  A technique for modeling the behavior of both natural and human-made systems  Goal Generate.
Technical Writing Function. The purpose of having guidelines is to make the document more readable. Standard guidelines govern – Format – page layout,
Advanced Research Methodology
COMP s1 Computing 2 Complexity
Source: How to Write a Report Source:
Sequences Informally, a sequence is a set of elements written in a row. – This concept is represented in CS using one- dimensional arrays The goal of mathematics.
Algorithms Describing what you know. Contents What are they and were do we find them? Why show the algorithm? What formalisms are used for presenting.
Simple Program Design Third Edition A Step-by-Step Approach
IMSS005 Computer Science Seminar
CSC 201 Analysis and Design of Algorithms Lecture 03: Introduction to a CSC 201 Analysis and Design of Algorithms Lecture 03: Introduction to a lgorithms.
Report Writing.
Academic Essays & Report Writing
Writing for Computer Science 7. Algorithms 8. Editing Cho, Ho-Gi GNU OSLab.
MATH 224 – Discrete Mathematics
8.4 paging Paging is a memory-management scheme that permits the physical address space of a process to be non-contiguous. The basic method for implementation.
1 Science as a Process Chapter 1 Section 2. 2 Objectives  Explain how science is different from other forms of human endeavor.  Identify the steps that.
Analysis of Algorithms
Introduction Algorithms and Conventions The design and analysis of algorithms is the core subject matter of Computer Science. Given a problem, we want.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley. Ver Chapter 9: Algorithm Efficiency and Sorting Data Abstraction &
How to develop an oral presentation You have one chance to make a point.
How to write a technical report Powerpoint: H VenterSpeakers: L Kruger Editor: GF De Wet G Claassen Group 42.
Report Format and Scientific Writing. What is Scientific Writing? Clear, simple, well ordered No embellishments, not an English paper Written for appropriate.
Major objective of this course is: Design and analysis of modern algorithms Different variants Accuracy Efficiency Comparing efficiencies Motivation thinking.
Program Efficiency & Complexity Analysis. Algorithm Review An algorithm is a definite procedure for solving a problem in finite number of steps Algorithm.
Algorithms & Flowchart
Fundamentals of Algorithms MCS - 2 Lecture # 8. Growth of Functions.
Algorithm Analysis CS 400/600 – Data Structures. Algorithm Analysis2 Abstract Data Types Abstract Data Type (ADT): a definition for a data type solely.
Introduction to Earth Science Section 2 Section 2: Science as a Process Preview Key Ideas Behavior of Natural Systems Scientific Methods Scientific Measurements.
Programming Languages and Design Lecture 3 Semantic Specifications of Programming Languages Instructor: Li Ma Department of Computer Science Texas Southern.
1 CSCD 326 Data Structures I Software Design. 2 The Software Life Cycle 1. Specification 2. Design 3. Risk Analysis 4. Verification 5. Coding 6. Testing.
1.  Interpretation refers to the task of drawing inferences from the collected facts after an analytical and/or experimental study.  The task of interpretation.
ICS 253: Discrete Structures I Induction and Recursion King Fahd University of Petroleum & Minerals Information & Computer Science Department.
 An article review is written for an audience who is knowledgeable in the subject matter instead of a general audience  When writing an article review,
1/6/20161 CS 3343: Analysis of Algorithms Lecture 2: Asymptotic Notations.
Ian F. C. Smith Writing a Journal Paper. 2 Disclaimer / Preamble This is mostly opinion. Suggestions are incomplete. There are other strategies. A good.
BY DR. HAMZA ABDULGHANI MBBS,DPHC,ABFM,FRCGP (UK), Diploma MedED(UK) Associate Professor DEPT. OF MEDICAL EDUCATION COLLEGE OF MEDICINE June 2012 Writing.
Format of Formal Reports
Introductions and Conclusions CSCI102 - Systems ITCS905 - Systems MCS Systems.
DESIGNING AN ARTICLE Effective Writing 3. Objectives Raising awareness of the format, requirements and features of scientific articles Sharing information.
Searching Topics Sequential Search Binary Search.
Chapter 11 Introduction to Computational Complexity Copyright © 2011 The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 1.
Writing Technical Reports in Science Writing in Science Writing in Science.
A SCIENTIFIC PAPER INCLUDES: Introduction: What question was studied and why? Methods: How was the problem studied? Results: What were the findings? and.
Technical Reports ELEC422 Design II. Objectives To gain experience in the process of generating disseminating and sharing of technical knowledge in electrical.
Chapter 3 Chapter Summary  Algorithms o Example Algorithms searching for an element in a list sorting a list so its elements are in some prescribed.
Many readers skim articles to find theorems (or other results such as illustrations or tables). For this reason, and because they may be quoted verbatim.
INTRODUCTION TO PROBLEM SOLVING
Components of thesis.
Writing Scientific Reports
HIERARCHY THEOREMS Hu Rui Prof. Takahashi laboratory
Objective of This Course
CS 201 Fundamental Structures of Computer Science
Asst. Dr.Surasak Mungsing
Presentation transcript:

Writing up

Outline The scope of a paper Storytelling Paper Organization Mathematics Algorithms

The scope of a paper Which results are the most surprising? What is the one result that other researchers might adopt in their work? Does it make sense to explain the new algorithms first, followed by description of the previous algorithms in terms of how they differ from the new work? Or is the contribution of the new work more obvious if the old approaches are described first, to set the context? What is the key background work that has to be discussed? Who is the readership? For example, are you writing for specialists in your area, your examiners, or a general computer science audience? What assumptions or definitions need to be formalized before the main theorem can be presented? Are the other outcomes independent enough to be published separately later on? Are they interesting enough to justify their being included?

The scope of a paper an investigation of external sorting in database systems a large relation-tens of millions of records, constituting several gigabytes-must be sorted on a field specified in a query. Costs include processing time for sorting and merging,transfer time to and from disk, and temporary space requirements. The balance between these costs is governed by available in-memory buffer space, as large blocks are expensive to sort but cheap to merge.

The content of a paper is determined by the readership. A paper on machine learning for computer vision may have entirely different implications for the two fields, and thus different aspects of the results might be emphasized. an expert on vision cannot be assumed to have any experience with machine learning

The publish venue determines the scope of the paper Is there a page limit? Are there specific conventions to be observed? Are the other papers in that venue primarily theoretical or experimental? What prior knowledge or background is a reader likely to have? Do the editors require that your code be available online?

Telling a story A paper is a sequence of concepts, building from a foundation of knowledge assumed to be common to all readers up to new ideas and results. There are several common ways for structuring the body of a paper, including as a chain, by specificity, by example, and by complexity.

compression for fast external sorting The problem statement consists of an explanation of external sorting and an argument that disk access costs are a crucial bottleneck. The review explains standard compression methods and why they cannot be integrated into external sorting. The new solution is the compression method developed in the research. The demonstration is a series of graphs and tables based on experiments that compare the cost of sorting with and without compression.

Telling a story Structure by specificity Structure by example an explanation of a retrieval system. Such systems generally have several components: Parsing, indexing, query, … Structure by example Structure by complexity

Organization Describe the work in the context of accepted scientific knowledge. State the idea that is being investigated,often as a theory or hypothesis. Explain what is new about the idea, what is being evaluated, or what contribution the paper is making. Justify the theory, by methods such as proof or experiment.

Organization Title and author Abstract Introduction Body Literature review Conclusions Bibliography Appendices

Body Introduction-Methods-Results-Discussion use of fixed headings may prohibit development of a complex explanation in stages "compression for the external sorting“ 1. Introduction 2. External sorting 3. Compression techniques for database systems 4. Sorting with compression 5. Experimental setup 6. Results and discussion 7. Conclusions

Literature review A literature review, or survey, is used to compare the new results to similar previously published results, to describe existing knowledge, and to explain how it is extended by the new results. In many papers the literature review material is not gathered into a single section, but is discussed where it is used

From draft to submission brain storm writing down in point form what has been learnt, what has been achieved, and what the results are prepare a skeleton, choosing results to emphasize and discarding material that on reflection seems irrelevant choose the section titles before writing any text When the structure is complete, each section can be sketched in perhaps 20 to 200 words

From draft to submission When the body and the closing summary are complete, the introduction usually needs substantial revision With a reasonably thorough draft completed, it is time to review the paper content and contribution For a novice writer who doesn't know where to begin, a good starting point is imitation

Mathematics

Mathematical Clarity Mathematics gives solidity to abstract concepts. There are well-established conventions of presentation for mathematics and mathematical concepts. Reading In mathematical writing it is essential to be precise.

Clarity X An inverted list for a given term is a sequence of pairs, where the first element in each pair is a document identifier and the second is the frequency of the term in the document to which the identifier corresponds . √ An inverted list for a term t is a sequence of pairs of the form (d, f) , where each d is a document identifier and f is the frequency of t in d.

Mathematical terms Normal, usual Definite, strict, proper, all, some Avoid "definite", "strict", and "proper" in their non-mathematical meanings, and be careful with "all" and "some" Intractable An algorithm or problem is "intractable" only if it is NP-hard Formula, equation A "formula" is not necessary an "equation"; the latter involves an equality. Equivalent, similar Average, mean. "Average" is used loosely to mean typical. Only use it in the formal sense-of arithmetic mean-if it is clear to the reader that the formal sense is intended. Otherwise use "mean" or even "arithmetic mean".

Theorems the details of the proof may not be important to the reader and can often be omitted. A common mistake is to unnecessarily include mechanical algebraic transformations Theorems, definitions, lemmas,and propositions should be numbered

state the main theorem first, then state and prove the lemmas before giving the main proof Explain the structure of long proofs before getting to the detail, and explain how each part of the proof relate to the structure.

Readability (1) mathematics does not, and so should not be used at the start of a sentence Give the type of each variable every time it is used, so that the reader doesn‘t have to remember as many details X The values are represented as a list of numbers L. √ The values are represented as a list L of numbers.

Readability (2) breaking down expressions to make them more readable, especially if doing so enlarges small symbols. Mathematical expressions should not run together.

Notation Ensure that the symbols you use will be correctly understood by, and familiar to, the reader The symbols ∽ and ≈ are all used to mean approximately equal to The symbol ≌ means lS congruent to, not approximately equal. Use ≤, not < =, for less than or equal to.

Ranges and sequences Ranger for Real number Ranger for integer: [a,b], [a,b), … Ranger for integer: It is common practice to use an ellipsis to describe a sequence of integers; thus m,...,n represents all integers between m and n inclusive.

Alphabets Use of characters from the Greek alphabet to denote variables and quantities can add clarity to mathematical writing Some mathematical symbols and characters from other alphabets have a superficial resemblance to more familiar symbols.

Line breaks

Numbers In technical writing, numbers should usually be written as figures, not spelt out. The common exceptions are approximate numbers numbers up to twenty, unless they are literal values or part of an expression of measurement Numbers at the start of a sentence, although it is generally better to recast the sentence so that the number is elsewhere Percentages should always be in figures

Numbers X 1024 computers were linked into the ring. X Partial compilation gave a 4-fold improvement. X The increase was over five per cent. X The method requires 2 passes . √ There were 1024 computers linked into the ring. √ Partial compilation gave a four-fold improvement. √ The increase was over 5 per cent. √ The increase was over 5%.

Numbers, Percentages X There were between four and 32 processors in each machine . √ There were between 4 and 32 processors in each machine. X There were 14 512-Kb sets . √ There were fourteen 512-Kb sets. Avoid the phrase "orders of magnitude". X The new algorithm is at least two orders of magnitude faster. √ The new algorithm is at least a hundred times faster.

Numbers, Percentages In this example, is the unit of magnitude binary or decimal? It would be better to be explicit. “there are 10 kinds of people in the world, those that understand binary and those that don' t.” X The likelihood of failure is 2: 1 . √ The likelihood of failure is one in three . √ The likelihood of failure is about 30%.

Units of measurement The larger units, especially "Pb", "Eb", "Zb", and "Yb", are unfamiliar to most readers and should be written in full at least once, preferably with an explanation.

Algorithms

Presentation of algorithms You must demonstrate that the algorithm is a worthwhile contribution show that it is correct (given appropriate input, it terminates with appropriate results) show, by proof, experiment, or both, that it meets some claimed performance bound.

Presentation of algorithms The steps that make up the algorithm. The input and output, and the internal data structures used by the algorithm. The scope of application of the algorithm and its limitations. The properties that will allow demonstration of correctness, such as preconditions, post-conditions, and loop invariants. A demonstration of correctness. A complexity analysis, for both space and time requirements. Experiments confirming the theoretical results.

Formalisms common formalisms for presenting algorithms the list style, in which the algorithm is broken down into a series of numbered or named steps and loops pseudocode, in which the algorithm is presented as if written in a block -structured language A better option is to use what might be called prosecode number each step, never break a loop over several steps, use sub-numbering for the parts of a step, and include explanatory text.

pseudocode

prosecode WeightedEdit(s,t) compares two strings s and t , of lengths ks and respectively, to determine the edit distance-the minimum cost in insertions, deletions….

Level of detail

Notation Mathematical notation is preferable to programming notation for presentation of algorithms. Use "xi" rather than "x [i] “ Mathematics provides many handy conventions and symbols that can be used in description of algorithms, including set notation, subscripts and superscripts

Environment of algorithms the data structures on which it operates input and output data types factors such as properties of the underlying operating system and hardware. Describe data structures carefully. use, say, a simple mathematical notation to unambiguously specify the structure . √ Each element is a triple (string , length , positions) in which positions is a set of byte offsets at which string has been observed.

Performance of algorithms Basis of evaluation Processing time Memory and disk requirements Disk and network traffic Power and Energy Consumption

Performance of algorithms Basis of evaluation. The basis of evaluation should be made explicit. Processing time. Time (or speed) over some given input is one of the principal resources used by algorithms Memory and disk requirements Disk and network traffic Applicability. Algorithms can be compared not only with regard to their resource requirements. but with regard to functionality.

Asymptotic complexity Big-O notation a function f(n) is said to be O (g (n ) )-that is, g (n) is an upper bound of f (n) if for some constants c and k we have f(n) ≤c . g( n) for all n > k.

Asymptotic complexity If f(n) is O(g(n)) and g(n) is O(f(n)) , then f(n) is a certain algorithm might require O(nlogn) comparisons and O(n)disk accesses. In principle the complexity of the algorithm is O(nlogn) , but, given that a disk access may require 5 milliseconds and a comparison less than a nanosecond, in practice the cost of the disk accesses might well dominate for any possible application.

the logic of asymptotic claims Amdahl's law states that the lower bound for the time taken for an algorithm to complete is determined by the part of the algorithm that is inherently sequential. it has been claimed that Amdahl's law was broken by, for a certain algorithm, increasing both the size of the input data and the number of processors. These changes had minimal impact on the sequential part of the algorithm

Sometimes a formal analysis is inappropriate or only a minor consideration. Analytical results often say nothing about constant factors or behavior in practice where CPU, cache can interact in unpredictable ways