PROGRAMMING IN HASKELL Type declarations and Modules let y = Add (Val 1) (Mul (Val 2) (Val 3)) Based on lecture notes by Graham Hutton The book “Learn You a Haskell for Great Good” (and a few other sources)
Recap of Typeclasses We have seen typeclasses, which describe classes of data where operations of a certain type make sense. Look more closely at an example, Eq: class Eq a where (==) :: a -> a -> Bool (/=) :: a -> a -> Bool x == y = not (x /= y) x /= y = not (x == y) 1
data TrafficLight = Red | Yellow | Green Now – say we want to make a new type and make sure it belongs to a given typeclass. Here’s how: data TrafficLight = Red | Yellow | Green instance Eq TrafficLight where Red == Red = True Green == Green = True Yellow == Yellow = True _ == _ = False 2
instance Show TrafficLight where show Red = "Red light" Now maybe we want to be able to display these at the prompt. To do this, we need to add this to the “show” class. instance Show TrafficLight where show Red = "Red light" show Yellow = "Yellow light" show Green = "Green light" 3
And finally, we can use these things: ghci> Red == Red True ghci> Red == Yellow False ghci> Red `elem` [Red, Yellow, Green] ghci> [Red, Yellow, Green] [Red light,Yellow light,Green light] 4
Modules So far, we’ve been using built-in functions provided in the Haskell prelude. This is a subset of a larger library that is provided with any installation of Haskell. (Google for Hoogle to see a handy search engine for these.) Examples of other modules: - lists - concurrent programming - complex numbers - char - sets - …
This is a function in Data.List that removes duplicates from a list. Example: Data.List To load a module, we need to import it: import Data.List All the functions in this module are immediately available: numUniques :: (Eq a) => [a] -> Int numUniques = length . nub This is a function in Data.List that removes duplicates from a list. function concatenation
You can also load modules from the command prompt: ghci> :m + Data.List Or several at once: ghci> :m + Data.List Data.Map Data.Set Or import only some, or all but some: import Data.List (nub, sort) import Data.List hiding (nub)
If duplication of names is an issue, can extend the namespace: import qualified Data.Map This imports the functions, but we have to use Data.Map to use them – like Data.Map.filter. When the Data.Map gets a bit long, we can provide an alias: import qualified Data.Map as M And now we can just type M.filter, and the normal list filter will just be filter.
ghci> intersperse '.' "MONKEY" "M.O.N.K.E.Y" Data.List has a lot more functionality than we’ve seen. A few examples: ghci> intersperse '.' "MONKEY" "M.O.N.K.E.Y" ghci> intersperse 0 [1,2,3,4,5,6] [1,0,2,0,3,0,4,0,5,0,6] ghci> intercalate " " ["hey","there","guys"] "hey there guys" ghci> intercalate [0,0,0] [[1,2,3],[4,5,6], [7,8,9]] [1,2,3,0,0,0,4,5,6,0,0,0,7,8,9] 9
ghci> transpose [[1,2,3],[4,5,6], [7,8,9]] And even more: ghci> transpose [[1,2,3],[4,5,6], [7,8,9]] [[1,4,7],[2,5,8],[3,6,9]] ghci> transpose ["hey","there","guys"] ["htg","ehu","yey","rs","e"] ghci> concat ["foo","bar","car"] "foobarcar" ghci> concat [[3,4,5],[2,3,4],[2,1,1]] [3,4,5,2,3,4,2,1,1] 10
ghci> and $ map (>4) [5,6,7,8] True And even more: ghci> and $ map (>4) [5,6,7,8] True ghci> and $ map (==4) [4,4,4,3,4] False ghci> any (==4) [2,3,5,6,1,4] True ghci> all (>4) [6,9,10] True 11
A nice example: adding functions Functions are often represented as vectors: 8x^3 + 5x^2 + x - 1 is [8,5,1,-1]. So we can easily use List functions to add these vectors: ghci> map sum $ transpose [[0,3,5,9], [10,0,0,9],[8,5,1,-1]] [18,8,6,17] 12
There are a ton of these functions, so I could spend all semester covering just lists. More examples: group, sort, dropWhile, takeWhile, partition, isPrefixOf, find, findIndex, delete, words, insert,… Instead, I’ll make sure to post a link to a good overview of lists on the webpage, in case you need them. In essence, if it’s a useful thing to do to a list, Haskell probably supports it! 13
Examples: isAlpha, isLower, isSpace, isDigit, isPunctuation,… The Data.Char module: includes a lot of useful functions that will look similar to python, actually. Examples: isAlpha, isLower, isSpace, isDigit, isPunctuation,… ghci> all isAlphaNum "bobby283" True ghci> all isAlphaNum "eddy the fish!"False ghci> groupBy ((==) `on` isSpace) "hey guys its me" ["hey"," ","guys"," ","its"," ","me"] 14
The Data.Char module has a datatype that is a set of comparisons on characters. There is a function called generalCategory that returns the information. (This is a bit like the Ordering type for numbers, which returns LT, EQ, or GT.) ghci> generalCategory ' ' Space ghci> generalCategory 'A' UppercaseLetter ghci> generalCategory 'a' LowercaseLetter ghci> generalCategory '.' OtherPunctuation ghci> generalCategory '9' DecimalNumber ghci> map generalCategory " ¥t¥nA9?|" [Space,Control,Control,UppercaseLetter,DecimalNumber,OtherPunctuation,MathSymbol] ] 15
There are also functions that can convert between Ints and Chars: ghci> map digitToInt "FF85AB" [15,15,8,5,10,11] ghci> intToDigit 15 'f' ghci> intToDigit 5 '5' ghci> chr 97 'a' ghci> map ord "abcdefgh" [97,98,99,100,101,102,103,104] 16
Neat application: Ceasar ciphers A primitive encryption cipher which encodes messages by shifted them a fixed amount in the alphabet. Example: hello with shift of 3 encode :: Int -> String -> String encode shift msg = let ords = map ord msg shifted = map (+ shift) ords in map chr shifted 17
ghci> encode 3 "Heeeeey" "Khhhhh|" ghci> encode 4 "Heeeeey" Now to use it: ghci> encode 3 "Heeeeey" "Khhhhh|" ghci> encode 4 "Heeeeey" "Liiiii}" ghci> encode 1 "abcd" "bcde" ghci> encode 5 "Marry Christmas! Ho ho ho!” "Rfww~%Hmwnxyrfx&%Mt%mt%mt&" 18
Decoding just reverses the encoding: decode :: Int -> String -> String decode shift msg = encode (negate shift) msg ghci> encode 3 "Im a little teapot" "Lp#d#olwwoh#whdsrw" ghci> decode 3 "Lp#d#olwwoh#whdsrw" "Im a little teapot" ghci> decode 5 . encode 5 $ "This is a sentence" "This is a sentence" 19
Making our own modules We specify our own modules at the beginning of a file. For example, if we had a set of geometry functions: module Geometry ( sphereVolume , sphereArea , cubeVolume , cubeArea , cuboidArea , cuboidVolume ) where
Then, we put the functions that the module uses: sphereVolume :: Float -> Float sphereVolume radius = (4.0 / 3.0) * pi * (radius ^ 3) sphereArea :: Float -> Float sphereArea radius = 4 * pi * (radius ^ 2) cubeVolume :: Float -> Float cubeVolume side = cuboidVolume side side side … 21
Note that we can have “private” helper functions, also: cuboidVolume :: Float -> Float -> Float -> Float cuboidVolume a b c = rectangleArea a b * c cuboidArea :: Float -> Float -> Float -> Float cuboidArea a b c = rectangleArea a b * 2 + rectangleArea a c * 2 + rectangleArea c b * 2 rectangleArea :: Float -> Float -> Float rectangleArea a b = a * b 22
Each will hold a separate group of functions. To load: Can also nest these. Make a folder called Geometry, with 3 files inside it: Sphere.hs Cubiod.hs Cube.hs Each will hold a separate group of functions. To load: import Geometry.Sphere Or (if functions have same names): import qualified Geometry.Sphere as Sphere 23
module Geometry.Sphere ( volume , area ) where The modules: module Geometry.Sphere ( volume , area ) where volume :: Float -> Float volume radius = (4.0 / 3.0) * pi * (radius ^ 3) area :: Float -> Float area radius = 4 * pi * (radius ^ 2) 24
module Geometry.Cuboid ( volume , area ) where volume :: Float -> Float -> Float -> Float volume a b c = rectangleArea a b * c … 25
Putting it together: Arithmetic Expressions Consider a simple form of expressions built up from integers using addition and multiplication. 1 + 3 2
Using recursion, a suitable new type to represent such expressions can be declared by: data Expr = Val Int | Add Expr Expr | Mul Expr Expr For example, the expression on the previous slide would be represented as follows: Add (Val 1) (Mul (Val 2) (Val 3))
Using recursion, it is now easy to define functions that process expressions. For example: size :: Expr Int size (Val n) = 1 size (Add x y) = size x + size y size (Mul x y) = size x + size y eval :: Expr Int eval (Val n) = n eval (Add x y) = eval x + eval y eval (Mul x y) = eval x * eval y
The three constructors have types: Note: The three constructors have types: Val :: Int Expr Add :: Expr Expr Expr Mul :: Expr Expr Expr Many functions on expressions can be defined by replacing the constructors by other functions using a suitable fold function. For example: eval = fold id (+) (*)
Then edit in the other two, and find a way to test! Exercise: Edit our simple expressions to support subtraction and division in eval as well. Grab expression.hs from the schedule page for the code so far. After loading, you can use it: echambe5@hopper$ ghci expression.hs GHCi, version 7.6.3: (etc) Loading (etc) Ok, modules loaded: Expression. *Expression> let y = Add (Val 1) (Mul (Val 2) (Val 3)) *Expression> eval y 7 *Expression> size y 3 Then edit in the other two, and find a way to test!
File I/O So far, we’ve worked mainly at the prompt, and done very little true input or output. This is logical in a functional language, since nothing has side effects! However, this is a problem with I/O, since the whole point is to take input (and hence change some value) and then output something (which requires changing the state of the screen or other I/O device. Luckily, Haskell offers work-arounds that separate the more imperative I/O.
A simple example: save the following file as helloword.hs main = putStrLn "hello, world" Now we actually compile a program: $ ghc --make helloworld [1 of 1] Compiling Main ( helloworld.hs, helloworld.o ) Linking helloworld ... $ ./helloworld hello, world 32
What are these functions? ghci> :t putStrLn putStrLn :: String -> IO () ghci> :t putStrLn "hello, world" putStrLn "hello, world" :: IO () So putStrLn takes a string and returns an I/O action (which has a result type of (), the empty tuple). In Haskell, an I/O action is one with a side effect - usually either reading or printing. Usually some kind of a return value, where () is a dummy value for no return. 33
A more interesting example: An I/O action will only be performed when you give it the name “main” and then run the program. A more interesting example: main = do putStrLn "Hello, what's your name?” name <- getLine putStrLn ("Hey " ++ name ++ ", you rock!") Notice the do statement - more imperative style. Each step is an I/O action, and these glue together. 34
More on getLine: ghci> :t getLine getLine :: IO String This is the first I/O we’ve seen that doesn’t have an empty tuple type - it has a String. Once the string is returned, we use the <- to bind the result to the specified identifier. Notice this is the first non-functional action we’ve seen, since this function will NOT have the same value every time it is run! This is called “impure” code, and the value name is “tainted”. 35
nameTag = "Hello, my name is " ++ getLine An invalid example: nameTag = "Hello, my name is " ++ getLine What’s the problem? Well, ++ requires both parameters to have the same type. What is the return type of getLine? Another word of warning: what does the following do? name = getLine 36
ghci> putStrLn "HEEY" HEEY Just remember that I/O actions are only performed in a few possible places: A main function inside a bigger I/O block that we have composed with a do (and remember that the last action can’t be bound to a name, since that is the one that is the return type). At the ghci prompt: ghci> putStrLn "HEEY" HEEY 37
Note that <- is for I/O, and let for expressions. You can use let statements inside do blocks, to call other functions (and with no “in” part required): import Data.Char main = do putStrLn "What's your first name?" firstName <- getLine putStrLn "What's your last name?" lastName <- getLine let bigFirstName = map toUpper firstName bigLastName = map toUpper lastName putStrLn $ "hey " ++ bigFirstName ++ " " ++ bigLastName ++ ", how are you?" Note that <- is for I/O, and let for expressions. 38
What is return? Does NOT signal the end of execution! Return instead makes an I/O action out of a pure value. main = do a <- return "heck" b <- return "yeah!" putStrLn $ a ++ " " ++ b In essence, return is the opposite of <-. Instead of “unwrapping” I/O Strings, it wraps them. 39
Last example was a bit redundant, though – could use a let instead: main = do let a = ”heck" b = "yeah" putStrLn $ a ++ " " ++ b Usually, you’ll use return to create I/O actions that don’t do anything (but you have to have one anyway, like an if-then-else), or for the last line of a do block, so it returns some value we want. 40
Takeaway: Return in haskell is NOT like other languages. main = do line <- getLine if null line then return () else do putStrLn $ reverseWords line main reverseWords :: String -> String reverseWords = unwords . map reverse . words Note: reverseWords = unwords . map reverse . words is the same as reverseWords st = nwords (map reverse (words st)) 41
print (works on any type in show, but calls show first) Other I/O functions: print (works on any type in show, but calls show first) putStr - And as putStrLn, but no newline putChar and getChar main = do print True print 2 print "haha" print 3.2 print [3,4,3] main = do c <- getChar if c /= ' ' then do putChar c main else return () 42
More advanced functionality is available in Control.Monad: import Control.Monad import Data.Char main = forever $ do putStr "Give me some input: " l <- getLine putStrLn $ map toUpper l (Will indefinitely ask for input and print it back out capitalized.) 43
sequence: takes list of I/O actions and does them one after the other Other functions: sequence: takes list of I/O actions and does them one after the other mapM: takes a function (which returns an I/O) and maps it over a list Others available in Control.Monad: when: takes boolean and I/O action. If bool is true, returns same I/O, and if false, does a return instead 44
System Level programming An example: System Level programming Scripting functionality deals with I/O as a necessity. The module System.Environment has several to help with this: getArgs: returns a list of the arguments that the program was run with getProgName: returns the string which is the program name (Note: I’ll be assuming you compile using “ghc –make myprogram” and then running “./myprogram”. But you could also do “runhaskell myprogram.hs”.) 45
An example: import System.Environment import Data.List main = do args <- getArgs progName <- getProgName putStrLn "The arguments are:" mapM putStrLn args putStrLn "The program name is:" putStrLn progName 46
$ ./arg-test first second w00t "multi word arg" The arguments are: The output: $ ./arg-test first second w00t "multi word arg" The arguments are: first second w00t multi word arg The program name is: arg-test 47