This week it’s time to venture out into the wild. You’ll be using a Rust library to implement an arithmetic expression engine. Likely, a lot of your time will be spent on the parser, built using nom, a parser combinator library. This will also be the first assignment involving non-trivial project structure.

The starter code can be found here. It includes the ‘parser’ from nom’s documentation which ‘parses’ an arithmetic expression into a value (quite cleverly).

The project is organized as follows:

src/
  repl.rs        - A binary which runs a REPL
  expr/          - The expr module
    mod.rs       - The entry to the expr module: re-exports Expr and its parser
    expr.rs      - The Expr definition
    parser.rs    - The Expr parser
  rexpr.rs       - See the bonus

Your job this week it to:

Expand the ‘parser’ into an actual parser - one that produces an Expr rather that just its value.
Implement evaluate for Expr, and fix the REPL to use it. This is integer arithmetic.
Implement a few statistic-gathering functions for Expr: depth and operation_count.
Write a new binary, named stat, which prints the value, depth, and operation count of an expression (separated by spaces).
Fill out the post-assignment questionnaire!

This assignment should be tested as follows:

diff <(cargo run --bin repl < examples/test.in) examples/test.repl.out
diff <(cargo run --bin stat < examples/test.in) examples/test.stat.out

Bonuses

They’re not necessarily in order of increasing difficulty

Bonus A: Optimizing Arithmetic Expressions.

This bonus has a few memory issues upfront, plus an open-ended optimization problem!

The idea is to minimize the depth of our arithmetic expressions. This is a fun problem and also has some practical use: when building circuits the depth of an expression roughly corresponds to the time required to compute it. The starting parser (and likely the one you build) is strictly left-associative, like division and subtraction are. However, addition and multiplication are associative operations, and we’d like to take advantage of that.

This bonus proceeds in the following steps:

Implement a conversion from Expr to RExpr (reduced expression), in which substraction is replaced with addition+negation. By doing so, you expose more associativity to your optimizations.
Implement evaluation and the statistic functions for RExpr (see src/reduced_expr/rexpr.rs). Make corresponding binary rstat, which implements the same interface as stat. Test this against examples/test.rstat.out.
Implement tree rotations for RExpr. These rotations should perform the rotation if it is structurally possible and would not change the value of the expression, and return whether or not the rotation was done. In essence, these rotations express the structural freedom given by associativity.
Use rotations (or some other tools) to write minimize_depth for RExpr. You may also take advantage of other properties of addition/multiplication (I.E. commutativity) if you want, but be sure to document this.
- This part is entirely open-ended – have fun!

Use your optimization to write a binary named reduce which parses input arithmetic expressions as Expr’s, prints their depth, converts them to RExpr, prints the new depth, minimizes the depth, and prints the final depth. All these depths should be space separated.

Bonus B: Whitespace Insensitivity.

Change your parser so that it is no longer whitespace sensitive! There are a few ways to go about this, so I’ll leave it up to you as along as you include a discussion of the methods you considered and their properties.

Bonus C: Lifetime Preview.

This bonus is weakly motivated, but deals more directly with memory management issues.

The goal for this bonus is to implment yet a third version of Expr, lets call it SliceExpr, which instead of storing the integer value of literals in the expression will store the slices into the input - the slices where those literals are found. You should also implement evaluate for this new data structure as well.

While this approach is useful in some cases (nom uses techniques like this under the hood), it’s pretty silly here because integer literals are actually smaller than slices :laughing:.

This bonus uses things we haven’t discussed yet, so I don’t really recommend it.

Bonus D: Blast From the Past

Last week a handfull of people implemented remove_first using pop to expose the item to remove, and then push to put everything else back.

The sample solution I wrote does a traversal and reassembly of sorts, but is recursive, which raises concerns about overflowing the stack (recall Bonus C and Adam’s presentation).

It is also possible to write a non-recursive traversal and re-assembly. Since no one got this working last week and the sample solution doesn’t do it either, you can take another crack at it this week!

You many find this comment helpful if you choose to do so:

I believe that you can make this “traversal style” work, but you have to be very careful about who owns which node when (what the data structure and stack variables own during each step). Pictures are helpful for this.

Essentially you want to take ownership of rest of the list at the beginning and walk through it, adding one node at a time to the back of the list. Likely the only &mut you should maintain is to the back of the list being reassembled.

As is often the case, the solution is elegant and easy to write if you’re thinking about it clearly. If you do this bonus, submit a PR against wk1-starter, and tell me you did so in your PR against wk2-starter.