Kathleen Fisher Tufts University Thanks to Simon Peyton Jones for many of these slides. References: “Real World Haskell”, Chapter 0 & 7 (http://book.realworldhaskell.org/)http://book.realworldhaskell.org/

Slides:



Advertisements
Similar presentations
Kathleen Fisher cs242 Reading: Tackling the Awkward Squad, Sections 1-2Tackling the Awkward Squad Real World Haskell, Chapter 7: I/OReal World Haskell.
Advertisements

Introduction to Compilation of Functional Languages Wanhe Zhang Computing and Software Department McMaster University 16 th, March, 2004.
Adapted from Scott, Chapter 6:: Control Flow Programming Language Pragmatics Michael L. Scott.
Kathleen Fisher cs242 Reading: “A history of Haskell: Being lazy with class”,A history of Haskell: Being lazy with class Section 6.4 and Section 7 “Monads.
Getting started with ML ML is a functional programming language. ML is statically typed: The types of literals, values, expressions and functions in a.
Functional Programming Universitatea Politehnica Bucuresti Adina Magda Florea
The IO Monad Mooly Sagiv Slides from John Mitchell Kathleen Fisher Simon Peyton Jones Reading: “Tackling the Awkward Squad” “Real World Haskell,” Chapter.
MATHEMATICAL FOUNDATIONS SUPPLEMENTAL MATERIAL – NOT ON EXAM.
Kathleen Fisher cs242 Reading: “Tackling the Awkward Squad,” Sections 1-2Tackling the Awkward Squad “Real World Haskell,” Chapter 7: I/OReal World Haskell.
A Lightning Tour of Haskell Lecture 1, Designing and Using Combinators John Hughes.
CS 280 Data Structures Professor John Peterson. Lexer Project Questions? Must be in by Friday – solutions will be posted after class The next project.
Type Inference: CIS Seminar, 11/3/2009 Type inference: Inside the Type Checker. A presentation by: Daniel Tuck.
CS510AP Quick Check Monads. QuickCheck Quick check is a Haskell library for doing random testing. You can read more about quickcheck at –
0 PROGRAMMING IN HASKELL Chapter 9 - Interactive Programs.
0 PROGRAMMING IN HASKELL An Introduction Based on lecture notes by Graham Hutton The book “Learn You a Haskell for Great Good” (and a few other sources)
0 PROGRAMMING IN HASKELL Chapter 11 - Interactive Programs, Declaring Types and Classes.
Dr. Muhammed Al-MulhemICS Chapter 5 The Algol Family and Haskell.
Haskell. 2 GHC and HUGS Haskell 98 is the current version of Haskell GHC (Glasgow Haskell Compiler, version 7.4.1) is the version of Haskell I am using.
Haskell Chapter 8. Input and Output  What are I/O actions?  How do I/O actions enable us to do I/O?  When are I/O actions actually performed?
1 COMP313A Functional Programming (1). 2 Main Differences with Imperative Languages Say more about what is computed as opposed to how Pure functional.
Comp150PLD Reading: “Tackling the Awkward Squad,” Sections 1-2Tackling the Awkward Squad “Real World Haskell,” Chapter 7: I/OReal World Haskell Thanks.
Deriving Combinator Implementations Lecture 4, Designing and Using Combinators John Hughes.
CSC 580 – Theory of Programming Languages, Spring, 2009 Week 9: Functional Languages ML and Haskell, Dr. Dale E. Parson.
Cohesion and Coupling CS 4311
Week 3 - Wednesday.  What did we talk about last time?  Other C features  sizeof, const  ASCII table  printf() format strings  Bitwise operations.
© M. Winter COSC 4P41 – Functional Programming Programming with actions Why is I/O an issue? I/O is a kind of side-effect. Example: Suppose there.
COMP313A Functional Programming (1)
0 Odds and Ends in Haskell: Folding, I/O, and Functors Adapted from material by Miran Lipovaca.
Arvind Computer Science and Artificial Intelligence Laboratory M.I.T. L13-1 October 26, 2006http:// Monadic Programming October.
Fall 2002CS 150: Intro. to Computing1 Streams and File I/O (That is, Input/Output) OR How you read data from files and write data to files.
Lee CSCE 314 TAMU 1 CSCE 314 Programming Languages Interactive Programs: I/O and Monads Dr. Hyunyoung Lee.
CMSC 330: Organization of Programming Languages Operational Semantics a.k.a. “WTF is Project 4, Part 3?”
CMSC 330: Organization of Programming Languages Operational Semantics.
24-Jun-16 Haskell Dealing with impurity. Purity Haskell is a “pure” functional programming language Functions have no side effects Input/output is a side.
1 PROGRAMMING IN HASKELL An Introduction Based on lecture notes by Graham Hutton The book “Learn You a Haskell for Great Good” (and a few other sources)
6-Jul-16 Haskell II Functions and patterns. Data Types Int + - * / ^ even odd Float + - * / ^ sin cos pi truncate Char ord chr isSpace isUpper … Bool.
1 ENERGY 211 / CME 211 Lecture 3 September 26, 2008.
CS314 – Section 5 Recitation 9
Functional Programming
Haskell Chapter 8.
Information and Computer Sciences University of Hawaii, Manoa
the IO Monad Kathleen Fisher cs242
Types CSCE 314 Spring 2016.
Chapter 7: Expressions and Assignment Statements
Theory of Computation Lecture 4: Programs and Computable Functions II
PROGRAMMING IN HASKELL
6.001 SICP Variations on a Scheme
CS 326 Programming Languages, Concepts and Implementation
CSEP505: Programming Languages Lecture 8: Haskell, Laziness, IO Monad
Representation, Syntax, Paradigms, Types
A lightening tour in 45 minutes
Chapter 7: Expressions and Assignment Statements
Haskell.
User-Defined Functions
I/O in C Lecture 6 Winter Quarter Engineering H192 Winter 2005
Haskell Dealing with impurity 30-Nov-18.
Haskell Dealing with impurity 30-Nov-18.
FP Foundations, Scheme In Text: Chapter 14.
Representation, Syntax, Paradigms, Types
Dynamic Scoping Lazy Evaluation
Representation, Syntax, Paradigms, Types
6.001 SICP Further Variations on a Scheme
Representation, Syntax, Paradigms, Types
Haskell Dealing with impurity 8-Apr-19.
PROGRAMMING IN HASKELL
Programming Languages
Haskell Dealing with impurity 29-May-19.
Haskell Dealing with impurity 28-Jun-19.
Advanced Functional Programming
Presentation transcript:

Kathleen Fisher Tufts University Thanks to Simon Peyton Jones for many of these slides. References: “Real World Haskell”, Chapter 0 & 7 ( “Tackling the Awkward Squad,” Sections 1-2Tackling the Awkward Squad

 Haskell is a programming language that is  Similar to ML: general-purpose, strongly typed, higher-order, functional, supports type inference, supports interactive and batch use  Different from ML: lazy evaluation, purely functional core, monadic IO, rapidly evolving type system.  Designed by committee in 80’s and 90’s to unify research efforts in lazy languages.  Haskell 1.0 in 1990, Haskell ‘98, Haskell’ ongoing. Haskell 1.0Haskell ‘98Haskell’  “A History of Haskell: Being Lazy with Class” HOPL 3A History of Haskell: Being Lazy with Class Paul Hudak John Hughes Simon Peyton Jones Phil Wadler

 Functional programming will make you think differently about programming.  Mainstream languages are all about state  Functional programming is all about values  Ideas will make you a better programmer in whatever language you regularly use.  Haskell is “cutting edge.” A lot of current research is done in the context of Haskell.  Lazy evaluation, managing side effects using monads, principled overloading using type classes

1yr5yr10yr15yr 1,000, ,000 The quick death Geeks Practitioners

1yr5yr10yr15yr 1,000, ,000 The slow death Geeks Practitioners

1yr5yr10yr15yr 1,000, ,000 The even slower death Geeks Practitioners

1yr5yr10yr15yr 1,000, ,000 The complete absence of death Geeks Practitioners Threshold of immortality

1,000, ,000 The second life? “Learning Haskell is a great way of training yourself to think functionally so you are ready to take full advantage of C# 3.0 when it comes out” (blog Apr 2007) “I'm already looking at coding problems and my mental perspective is now shifting back and forth between purely OO and more FP styled solutions” (blog Mar 2007) Geeks Practitioners

In Haskell, f :: A  B means for every x  A, In words, “if f x terminates, then f x  B.” In ML, functions with type A  B can throw an exception, but not in Haskell. Also, if e1 = e2, then f e1 = f e2. f x = some element y = f x  B run forever

 x :: Int means “x has type Int”  y : ys means “cons y onto list ys”  \x -> x + 1 “\” means lambda  Required capitalizations:  Expression identifiers are lowercase.  Type constructors (names) are uppercase.  Type variables are lower case.

 Notation for constructing new lists from old: myData = [1,2,3,4,5,6,7] twiceData = [2 * x | x <- myData] -- [2,4,6,8,10,12,14] twiceEvenData = [2 * x| x <- myData, x `mod` 2 == 0] -- [4,8,12]

 Haskell is a lazy language  Functions and data constructors don’t evaluate their arguments until they need them.  Programmers can write control-flow operators that have to be built-in in eager languages. cond :: Bool -> a -> a -> a cond True t e = t cond False t e = e (||) :: Bool -> Bool -> Bool True || x = True False || x = x Short- circuiting “or”

isSubString :: String -> String -> Bool x `isSubString` s = or [ x `isPrefixOf` t | t <- suffixes s ] suffixes:: String -> [String] -- All suffixes of s suffixes[] = [[]] suffixes(x:xs) = (x:xs) : suffixes xs or :: [Bool] -> Bool -- (or bs) returns True if any of the bs is True or [] = False or (b:bs) = b || or bs type String = [Char]

 Generate all solutions (an enormous tree)  Walk the tree to find the solution you want nextMove :: Board -> Move nextMove b = selectMove allMoves where allMoves = allMovesFrom b A gigantic (perhaps infinite) tree of possible moves

Purity makes the interface explicit.  Takes a list, and returns a list; that’s all.  Takes a list; may modify it; may modify other persistent state; may do I/O. reverse:: [w] -> [w]-- Haskell void reverse( list l )/* C */

Pure functions are easy to test.  In an imperative or OO language, you have to  set up the state of the object and the external state it reads or writes  make the call  inspect the state of the object and the external state  perhaps copy part of the object or global state, so that you can use it in the post condition prop_RevRev l = reverse(reverse l) == l

Pure functional programming is beautiful:  Concise and powerful abstractions  higher-order functions, algebraic data types, parametric polymorphism, principled overloading,...  Close correspondence with mathematics  Semantics of a code function is the math function  Equational reasoning: if x = y, then f x = f y  Independence of order-of-evaluation (Church-Rosser) e1 * e2 e1’ * e2e1 * e2’ result The compiler can choose the best order in which to do evaluation, including skipping a term if it is not needed.

 But to be useful as well as beautiful, a language must manage the “Awkward Squad”:  Input/Output  Imperative update  Error recovery (eg, timing out, catching divide by zero, etc.)  Foreign-language interfaces  Concurrency The whole point of a running a program is to affect the real world, an “update in place.”

 Do everything the “usual way”:  I/O via “functions” with side effects:  Imperative operations via assignable reference cells:  Error recovery via exceptions  Foreign language procedures mapped to “functions”  Concurrency via operating system threads  Ok if evaluation order is baked into the language. putchar ‘x’ + putchar ‘y’ z = ref 0; z := !z + 1; f(z); w = !z (* What is the value of w? *)

 Consider:  Output depends upon the evaluation order of ( + ).  Consider:  Output depends on how the consumer uses the list. If only used in length ls, nothing will be printed because length does not evaluate elements of list. In a lazy functional language, like Haskell, the order of evaluation is deliberately undefined, so the “direct approach” will not work. res = putchar ‘x’ + putchar ‘y’ ls = [putchar ‘x’, putchar ‘y’]

 Laziness and side effects are incompatible.  Side effects are important!  For a long time, this tension was embarrassing to the lazy functional programming community.  In early 90’s, a surprising solution (the monad) emerged from an unlikely source (category theory).  Haskell’s IO monad provides a way of tackling the awkward squad: I/O, imperative state, exceptions, foreign functions, & concurrency.

A functional program defines a pure function, with no side effects. The whole point of running a program is to have some side effect. Tension

 Streams  Program issues a stream of requests to OS, which responds with a stream of responses.  Continuations  User supplies continuations to I/O routines to specify how to process results.  World-Passing  The “World” is passed around and updated, like a normal data structure.  Not a serious contender because designers didn’t know how to guarantee single-threaded access to the world.  Stream and Continuation models were discovered to be inter-definable.  Haskell 1.0 Report adopted Stream model.

A value of type ( IO t ) is an “action.” When performed, it may do some input/output before delivering a result of type t.

type IO t = World -> (t, World) IO t result :: t

 “Actions” are sometimes called “computations.”  An action is a first-class value.  Evaluating an action has no effect; performing the action has the effect. A value of type ( IO t ) is an “action.” When performed, it may do some input/output before delivering a result of type t. type IO t = World -> (t, World)

getChar Char putChar () Char getChar :: IO Char putChar :: Char -> IO () main :: IO () main = putChar ‘x’ Main program is an action of type IO ()

putChar () getChar Char To read a character and then write it back out, we need to connect two actions. The “bind” combinator lets us make these connections.

 We have connected two actions to make a new, bigger action. putChar () Char getChar (>>=) :: IO a -> (a -> IO b) -> IO b echo :: IO () echo = getChar >>= putChar

 Operator is called bind because it binds the result of the left-hand action in the action on the right.  Performing compound action a >>= \x->b:  performs action a, to yield value r  applies function \x->b to r  performs the resulting action b{x <- r}  returns the resulting value v b v a x r

 The parentheses are optional because lambda abstractions extend “as far to the right as possible.”  The putChar function returns unit, so there is no interesting value to pass on. echoDup :: IO () echoDup = getChar >>= (\c -> putChar c >>= (\() -> putChar c ))

 The “then” combinator (>>) does sequencing when there is no value to pass: (>>) :: IO a -> IO b -> IO b m >> n = m >>= (\_ -> n) echoDup :: IO () echoDup = getChar >>= \c -> putChar c >> putChar c echoTwice :: IO () echoTwice = echo >> echo

 We want to return (c1,c2).  But, (c1,c2) :: (Char, Char)  And we need to return something of type IO(Char, Char)  We need to have some way to convert values of “plain” type into the I/O Monad. getTwoChars :: IO (Char,Char) getTwoChars = getChar>>= \c1 -> getChar>>= \c2 -> ????

 The action (return v) does no IO and immediately returns v: return :: a -> IO a return getTwoChars :: IO (Char,Char) getTwoChars = getChar>>= \c1 -> getChar>>= \c2 -> return (c1,c2)

 The “do” notation adds syntactic sugar to make monadic code easier to read.  Do syntax designed to look imperative. -- Do Notation getTwoCharsDo :: IO(Char,Char) getTwoCharsDo = do { c1 <- getChar ; c2 <- getChar ; return (c1,c2) } -- Plain Syntax getTwoChars :: IO (Char,Char) getTwoChars = getChar>>= \c1 -> getChar>>= \c2 -> return (c1,c2)

 The “do” notation only adds syntactic sugar: do { x >= \x -> do { es } do { e; es }=e >> do { es } do { e }=e do {let ds; es} = let ds in do {es} The scope of variables bound in a generator is the rest of the “do” expression. The last item in a “do” expression must be an expression.

 The following are equivalent: do { x1 <- p1;...; xn <- pn; q } do x1 <- p1... xn <- pn q do x1 <- p1;...; xn <- pn; q If the semicolons are omitted, then the generators must line up. The indentation replaces the punctuation.

 The getLine function reads a line of input: getLine :: IO [Char] getLine = do { c <- getChar ; if c == '\n' then return [] else do { cs <- getLine; return (c:cs) }} Note the “regular” code mixed with the monadic operations and the nested “do” expression.

 Each action in the IO monad is a possible stage in an assembly line.  For an action with type IO a, the type  tags the action as suitable for the IO assembly line via the IO type constructor.  indicates that the kind of thing being passed to the next stage in the assembly line has type a.  The bind operator “snaps” two stages s1 and s2 together to build a compound stage.  The return operator converts a pure value into a stage in the assembly line.  The assembly line does nothing until it is turned on.  The only safe way to “run” an IO assembly is to execute the program, either using ghci or running an executable. 12

 Running the program turns on the IO assembly line.  The assembly line gets “the world” as its input and delivers a result and a modified world.  The types guarantee that the world flows in a single thread through the assembly line. Result ghci or compiled program

 Values of type (IO t) are first class, so we can define our own control structures.  Example use: forever :: IO () -> IO () forever a = a >> forever a repeatN :: Int -> IO () -> IO () repeatN 0 a = return () repeatN n a = a >> repeatN (n-1) a Main> repeatN 5 (putChar 'h')

 Values of type (IO t) are first class, so we can define our own control structures.  Example use: for :: [a] -> (a -> IO b) -> IO () for [] fa = return () for (x:xs) fa = fa x >> for xs fa Main> for [1..10] (\x -> putStr (show x))

 Example use: sequence :: [IO a] -> IO [a] sequence [] = return [] sequence (a:as) = do { r <- a; rs <- sequence as; return (r:rs) } Main> sequence [getChar, getChar, getChar] A list of IO actions. An IO action returning a list.

Slogan: First-class actions let programmers write application-specific control structures.

 The IO Monad provides a large collection of operations for interacting with the “World.”  For example, it provides a direct analogy to the Standard C library functions for files: openFile :: FilePath -> IOMode -> IO Handle hPutStr :: Handle -> String -> IO () hGetLine :: Handle -> IO String hClose :: Handle -> IO ()

 The IO operations let us write programs that do I/O in a strictly sequential, imperative fashion.  Idea: We can leverage the sequential nature of the IO monad to do other imperative things!  A value of type IORef a is a reference to a mutable cell holding a value of type a. data IORef a -- Abstract type newIORef :: a -> IO (IORef a) readIORef :: IORef a -> IO a writeIORef :: IORef a -> a -> IO ()

But this is terrible! Contrast with: sum [1..n]. Claims to need side effects, but doesn’t really. import Data.IORef -- import reference functions -- Compute the sum of the first n integers count :: Int -> IO Int count n = do { r <- newIORef 0; addToN r 1 } where addToN :: IORef Int -> Int -> IO Int addToN r i | i > n = readIORef r | otherwise = do { v <- readIORef r ; writeIORef r (v + i) ; addToN r (i+1)}

import Data.IORef -- import reference functions -- Compute the sum of the first n integers count :: Int -> IO Int count n = do { r <- newIORef 0; addToN r 1 } where addToN :: IORef Int -> Int -> IO Int addToN r i | i > n = readIORef r | otherwise = do { v <- readIORef r ; writeIORef r (v + i) ; addToN r (i+1)} Just because you can write C code in Haskell, doesn’t mean you should!

 Track the number of chars written to a file.  Here it makes sense to use a reference. type HandleC = (Handle, IORef Int) openFileC :: FilePath -> IOMode -> IO HandleC openFileC file mode = do { h <- openFile file mode ; v <- newIORef 0 ; return (h,v) } hPutStrC :: HandleC -> String -> IO() hPutStrC (h,r) cs = do { v <- readIORef r ; writeIORef r (v + length cs) ; hPutStr h cs }

 All operations return an IO action, but only bind (>>=) takes one as an argument.  Bind is the only operation that combines IO actions, which forces sequentiality.  Within the program, there is no way out! return :: a -> IO a (>>=) :: IO a -> (a -> IO b) -> IO b getChar :: IO Char putChar :: Char -> IO ()... more operations on characters... openFile :: [Char] -> IOMode -> IO Handle... more operations on files... newIORef :: a -> IO (IORef a)... more operations on references...

 Suppose you wanted to read a configuration file at the beginning of your program:  The problem is that readFile returns an IO String, not a String.  Option 1: Write entire program in IO monad. But then we lose the simplicity of pure code.  Option 2: Escape from the IO Monad using a function from IO String -> String. But this is the very thing that is disallowed! configFileContents :: [String] configFileContents = lines (readFile "config") -- WRONG! useOptimisation :: Bool useOptimisation = "optimise" ‘elem‘ configFileContents

 Reading a file is an I/O action, so in general it matters when we read the file.  But we know the configuration file will not change during the program, so it doesn’t matter when we read it.  This situation arises sufficiently often that Haskell implementations offer one last unsafe I/O primitive: unsafePerformIO. unsafePerformIO :: IO a -> a configFileContents :: [String] configFileContents = lines(unsafePerformIO(readFile "config"))

 The operator has a deliberately long name to discourage its use.  Its use comes with a proof obligation: a promise to the compiler that the timing of this operation relative to all other operations doesn’t matter. unsafePerformIO :: IO a -> a Result act Invent World Discard World

 As its name suggests, unsafePerformIO breaks the soundness of the type system.  So claims that Haskell is type safe only apply to programs that don’t use unsafePerformIO.  Similar examples are what caused difficulties in integrating references with Hindley/Milner type inference in ML. r :: forall a. IORef a -- This is bad! r = unsafePerformIO (newIORef (error "urk")) cast :: b -> c cast x = unsafePerformIO (do {writeIORef r x; readIORef r })

 GHC uses world-passing semantics for the IO monad:  It represents the “world” by an un-forgeable token of type World, and implements bind and return as:  Using this form, the compiler can do its normal optimizations. The dependence on the world ensures the resulting code will still be single-threaded.  The code generator then converts the code to modify the world “in-place.” type IO t = World -> (t, World) return :: a -> IO a return a = \w -> (a,w) (>>=) :: IO a -> (a -> IO b) -> IO b (>>=) m k = \w -> case m w of (r,w’) -> k r w’

 What makes the IO Monad a Monad?  A monad consists of:  A type constructor M  A function bind :: M a -> ( a -> M b) -> M b  A function return :: a -> M a  Plus: Laws about how these operations interact.

return x >>= f = f x m >>= return = m do { x <- m 1 ; y <- m 2 ; m 3 } do { y <- do { x <- m 1 ; m 2 } m 3 } = x not in free vars of m 3

done >> m = m m >> done = m m 1 >> (m 2 >> m 3 ) = (m 1 >> m 2 ) >> m 3 (>>) :: IO a -> IO b -> IO b m >> n = m >>= (\_ -> n) done :: IO () done = return ()

 Using the monad laws and equational reasoning, we can prove program properties. putStr :: String -> IO () putStr [] = done putStr (c:s) = putChar c >> putStr s Proposition : putStr r >> putStr s = putStr (r ++ s)

putStr :: String -> IO () putStr [] = done putStr (c:cs) = putChar c >> putStr cs Proof: By induction on r. Base case: r is [] putStr [] >> putStr s = ( definition of putStr ) done >> putStr s = ( first monad law for >>) putStr s = (definition of ++) putStr ([] ++ s) Induction case: r is (c:cs) … Proposition : putStr r >> putStr s = putStr (r ++ s)

 A Haskell program is a single IO action called main. Inside IO, evaluation order is defined.  Big IO actions are built by gluing together smaller ones with bind ( >>= ) and by converting pure code into actions with return.  IO actions are first-class.  They can be passed to functions, returned from functions, and stored in data structures.  So it is easy to define new “glue” combinators.  The IO Monad allows Haskell to be pure while efficiently supporting side effects.  The type system separates the pure from the effectful code.

 In languages like ML or Java, the fact that the language is in the IO monad is baked in to the language. There is no need to mark anything in the type system because it is everywhere.  In Haskell, the programmer can choose when to live in the IO monad and when to live in the realm of pure functional programming.  So it is not Haskell that lacks imperative features, but rather the other languages that lack the ability to have a statically distinguishable pure subset.

 Download:  Interactive:  ghci HaskellIntro.hs  Compiled:  ghc –make HaskellIntro.hs