Haskell is an advanced purely-functional programming language.
The latest standard of Haskell is Haskell 2010. As of May 2016, a group is working on the next version, Haskell 2020.
The official Haskell documentation is also a comprehensive and useful resource. Great place to find books, courses, tutorials, manuals, guides, etc.
Version | Release Date |
---|---|
Haskell 2010 | 2012-07-10 |
Haskell 98 | 2002-12-01 |
The easiest way to get started writing Haskell is probably by going to the Haskell website or Try Haskell and use the online REPL (read-eval-print-loop) on the home page. The online REPL supports most basic functionality and even some IO. There is also a basic tutorial available which can be started by typing the command help
. An ideal tool to start learning the basics of Haskell and try out some stuff.
For programmers that are ready to engage a little bit more, there is GHCi, an interactive environment that comes with the Glorious/Glasgow Haskell Compiler. The GHC can be installed separately, but that is only a compiler. In order to be able to install new libraries, tools like Cabal and Stack must be installed as well. If you are running a Unix-like operating system, the easiest installation is to install Stack using:
curl -sSL https://get.haskellstack.org/ | sh
This installs GHC isolated from the rest of your system, so it is easy to remove. All commands must be preceded by stack
though. Another simple approach is to install a Haskell Platform. The platform exists in two flavours:
These platforms can be installed by downloading an installer and following the instructions or by using your distribution's package manager (note that this version is not guaranteed to be up-to-date):
Ubuntu, Debian, Mint:
sudo apt-get install haskell-platform
Fedora:
sudo dnf install haskell-platform
Redhat:
sudo yum install haskell-platform
Arch Linux:
sudo pacman -S ghc cabal-install haskell-haddock-api \
haskell-haddock-library happy alex
Gentoo:
sudo layman -a haskell
sudo emerge haskell-platform
OSX with Homebrew:
brew cask install haskell-platform
OSX with MacPorts:
sudo port install haskell-platform
Once installed, it should be possible to start GHCi by invoking the ghci
command anywhere in the terminal. If the installation went well, the console should look something like
me@notebook:~$ ghci
GHCi, version 6.12.1: http://www.haskell.org/ghc/ :? for help
Prelude>
possibly with some more information on what libraries have been loaded before the Prelude>
. Now, the console has become a Haskell REPL and you can execute Haskell code as with the online REPL. In order to quit this interactive environment, one can type :q
or :quit
. For more information on what commands are available in GHCi, type :?
as indicated in the starting screen.
Because writing the same things again and again on a single line is not always that practically, it might be a good idea to write the Haskell code in files. These files normally have .hs
for an extension and can be loaded into the REPL by using :l
or :load
.
As mentioned earlier, GHCi is a part of the GHC, which is actually a compiler. This compiler can be used to transform a .hs
file with Haskell code into a running program. Because a .hs
file can contain a lot of functions, a main
function must be defined in the file. This will be the starting point for the program. The file test.hs
can be compiled with the command
ghc test.hs
this will create object files and an executable if there were no errors and the main
function was defined correctly.
It has already been mentioned earlier as package manager, but stack can be a useful tool for Haskell development in completely different ways. Once installed, it is capable of
IHaskell is a haskell kernel for IPython and allows to combine (runnable) code with markdown and mathematical notation.
We can declare a series of expressions in the REPL like this:
Prelude> let x = 5
Prelude> let y = 2 * 5 + x
Prelude> let result = y * 10
Prelude> x
5
Prelude> y
15
Prelude> result
150
To declare the same values in a file we write the following:
-- demo.hs
module Demo where
-- We declare the name of our module so
-- it can be imported by name in a project.
x = 5
y = 2 * 5 + x
result = y * 10
Module names are capitalized, unlike variable names.
The factorial function is a Haskell "Hello World!" (and for functional programming generally) in the sense that it succinctly demonstrates basic principles of the language.
fac :: (Integral a) => a -> a
fac n = product [1..n]
Integral
is the class of integral number types. Examples include Int
and Integer
.(Integral a) =>
places a constraint on the type a
to be in said classfac :: a -> a
says that fac
is a function that takes an a
and returns an a
product
is a function that accumulates all numbers in a list by multiplying them together.[1..n]
is special notation which desugars to enumFromTo 1 n
, and is the range of numbers 1 ≤ x ≤ n
.fac :: (Integral a) => a -> a
fac 0 = 1
fac n = n * fac (n - 1)
This variation uses pattern matching to split the function definition into separate cases. The first definition is invoked if the argument is 0
(sometimes called the stop condition) and the second definition otherwise (the order of definitions is significant). It also exemplifies recursion as fac
refers to itself.
It is worth noting that, due to rewrite rules, both versions of fac
will compile to identical machine code when using GHC with optimizations activated. So, in terms of efficiency, the two would be equivalent.
Lazy evaluation means Haskell will evaluate only list items whose values are needed.
The basic recursive definition is:
f (0) <- 0
f (1) <- 1
f (n) <- f (n-1) + f (n-2)
If evaluated directly, it will be very slow. But, imagine we have a list that records all the results,
fibs !! n <- f (n)
Then
┌──────┐ ┌──────┐ ┌──────┐
│ f(0) │ │ f(1) │ │ f(2) │
fibs -> 0 : 1 : │ + │ : │ + │ : │ + │ : .....
│ f(1) │ │ f(2) │ │ f(3) │
└──────┘ └──────┘ └──────┘
┌────────────────────────────────────────┐
│ f(0) : f(1) : f(2) : ..... │
└────────────────────────────────────────┘
-> 0 : 1 : +
┌────────────────────────────────────────┐
│ f(1) : f(2) : f(3) : ..... │
└────────────────────────────────────────┘
This is coded as:
fibn n = fibs !! n
where
fibs = 0 : 1 : map f [2..]
f n = fibs !! (n-1) + fibs !! (n-2)
Or even as
GHCi> let fibs = 0 : 1 : zipWith (+) fibs (tail fibs)
GHCi> take 10 fibs
[0, 1, 1, 2, 3, 5, 8, 13, 21, 34]
zipWith
makes a list by applying a given binary function to corresponding elements of the two lists given to it, so zipWith (+) [x1, x2, ...] [y1, y2, ...]
is equal to [x1 + y1, x2 + y2, ...]
.
Another way of writing fibs
is with the scanl
function:
GHCi> let fibs = 0 : scanl (+) 1 fibs
GHCi> take 10 fibs
[0, 1, 1, 2, 3, 5, 8, 13, 21, 34]
scanl
builds the list of partial results that foldl
would produce, working from left to right along the input list. That is, scanl f z0 [x1, x2, ...]
is equal to [z0, z1, z2, ...] where z1 = f z0 x1; z2 = f z1 x2; ...
.
Thanks to lazy evaluation, both functions define infinite lists without computing them out entirely. That is, we can write a fib
function, retrieving the nth element of the unbounded Fibonacci sequence:
GHCi> let fib n = fibs !! n -- (!!) being the list subscript operator
-- or in point-free style:
GHCi> let fib = (fibs !!)
GHCi> fib 9
34
A basic "Hello, World!" program in Haskell can be expressed concisely in just one or two lines:
main :: IO ()
main = putStrLn "Hello, World!"
The first line is an optional type annotation, indicating that main
is a value of type IO ()
, representing an I/O action which "computes" a value of type ()
(read "unit"; the empty tuple conveying no information) besides performing some side effects on the outside world (here, printing a string at the terminal). This type annotation is usually omitted for main
because it is its only possible type.
Put this into a helloworld.hs
file and compile it using a Haskell compiler, such as GHC:
ghc helloworld.hs
Executing the compiled file will result in the output "Hello, World!"
being printed to the screen:
./helloworld
Hello, World!
Alternatively, runhaskell
or runghc
make it possible to run the program in interpreted mode without having to compile it:
runhaskell helloworld.hs
The interactive REPL can also be used instead of compiling. It comes shipped with most Haskell environments, such as ghci
which comes with the GHC compiler:
ghci> putStrLn "Hello World!"
Hello, World!
ghci>
Alternatively, load scripts into ghci from a file using load
(or :l
):
ghci> :load helloworld
:reload
(or :r
) reloads everything in ghci:
Prelude> :l helloworld.hs
[1 of 1] Compiling Main ( helloworld.hs, interpreted )
<some time later after some edits>
*Main> :r
Ok, modules loaded: Main.
This first line is a type signature, declaring the type of main
:
main :: IO ()
Values of type IO ()
describe actions which can interact with the outside world.
Because Haskell has a fully-fledged Hindley-Milner type system which allows for automatic type inference, type signatures are technically optional: if you simply omit the main :: IO ()
, the compiler will be able to infer the type on its own by analyzing the definition of main
. However, it is very much considered bad style not to write type signatures for top-level definitions. The reasons include:
Type signatures in Haskell are a very helpful piece of documentation because the type system is so expressive that you often can see what sort of thing a function is good for simply by looking at its type. This “documentation” can be conveniently accessed with tools like GHCi. And unlike normal documentation, the compiler's type checker will make sure it actually matches the function definition!
Type signatures keep bugs local. If you make a mistake in a definition without providing its type signature, the compiler may not immediately report an error but instead simply infer a nonsensical type for it, with which it actually typechecks. You may then get a cryptic error message when using that value. With a signature, the compiler is very good at spotting bugs right where they happen.
This second line does the actual work:
main = putStrLn "Hello, World!"
If you come from an imperative language, it may be helpful to note that this definition can also be written as:
main = do {
putStrLn "Hello, World!" ;
return ()
}
Or equivalently (Haskell has layout-based parsing; but beware mixing tabs and spaces inconsistently which will confuse this mechanism):
main = do
putStrLn "Hello, World!"
return ()
Each line in a do
block represents some monadic (here, I/O) computation, so that the whole do
block represents the overall action comprised of these sub-steps by combining them in a manner specific to the given monad (for I/O this means just executing them one after another).
The do
syntax is itself a syntactic sugar for monads, like IO
here, and return
is a no-op action producing its argument without performing any side effects or additional computations which might be part of a particular monad definition.
The above is the same as defining main = putStrLn "Hello, World!"
, because the value putStrLn "Hello, World!"
already has the type IO ()
. Viewed as a “statement”, putStrLn "Hello, World!"
can be seen as a complete program, and you simply define main
to refer to this program.
You can look up the signature of putStrLn
online:
putStrLn :: String -> IO ()
-- thus,
putStrLn (v :: String) :: IO ()
putStrLn
is a function that takes a string as its argument and outputs an I/O-action (i.e. a value representing a program that the runtime can execute). The runtime always executes the action named main
, so we simply need to define it as equal to putStrLn "Hello, World!"
.
A few most salient variants:
import Data.List ( (\\) )
ps100 = ((([2..100] \\ [4,6..100]) \\ [6,9..100]) \\ [10,15..100]) \\ [14,21..100]
-- = (((2:[3,5..100]) \\ [9,15..100]) \\ [25,35..100]) \\ [49,63..100]
-- = (2:[3,5..100]) \\ ([9,15..100] ++ [25,35..100] ++ [49,63..100])
Sieve of Eratosthenes, using data-ordlist package:
import qualified Data.List.Ordered
ps = 2 : _Y ((3:) . minus [5,7..] . unionAll . map (\p -> [p*p, p*p+2*p..]))
_Y g = g (_Y g) -- = g (g (_Y g)) = g (g (g (g (...)))) = g . g . g . g . ...
(a sub-optimal trial division sieve)
ps = sieve [2..]
where
sieve (x:xs) = [x] ++ sieve [y | y <- xs, rem y x > 0]
-- = map head ( iterate (\(x:xs) -> filter ((> 0).(`rem` x)) xs) [2..] )
ps = 2 : [n | n <- [3..], all ((> 0).rem n) $ takeWhile ((<= n).(^2)) ps]
-- = 2 : [n | n <- [3..], foldr (\p r-> p*p > n || (rem n p > 0 && r)) True ps]
From trial division to sieve of Eratosthenes:
[n | n <- [2..], []==[i | i <- [2..n-1], j <- [0,i..n], j==n]]
nubBy (((>1).).gcd) [2..] -- i.e., nubBy (\a b -> gcd a b > 1) [2..]
nubBy
is also from Data.List
, like (\\)
.