

## **Implicit State Machines**

## Fengyun Liu Oracle Labs Switzerland fengyun.liu@oracle.com

## Aleksandar Prokopec Oracle Labs Switzerland aleksandar.prokopec@oracle.com

Abstract

Finite-state machines (FSM) are a simple yet powerful abstraction widely used for modeling, programming and verifying real-time and reactive systems that control modern factories, power plants, transportation systems and medical equipment.

However, traditionally finite-state machines are either encoded indirectly in an *imperative* language, such as C and Verilog, or embedded as an *imperative* extension of a declarative language, such as Lustre. Given the widely accepted advantage of declarative programming, can we have a declarative design of finite-state machines to facilitate design, construction, and verification of embedded programs?

By sticking to the design principle of declarativeness, we show that a novel abstraction emerges, *implicit state machines*, which is declarative in nature and at the same time supports recursive composition. Given its simplicity and universality, we believe it may serve as a new foundation for programming embedded systems.

# CCS Concepts: • Hardware $\rightarrow$ Software tools for EDA; • Software and its engineering $\rightarrow$ Domain specific languages.

*Keywords:* Finite-state machines, hierarchical finite-state machines, implicit state machines

## ACM Reference Format:

Fengyun Liu and Aleksandar Prokopec. 2022. Implicit State Machines. In Proceedings of the 23rd ACM SIGPLAN/SIGBED International Conference on Languages, Compilers, and Tools for Embedded Systems (LCTES '22), June 14, 2022, San Diego, CA, USA. ACM, New York, NY, USA, 13 pages. https://doi.org/10.1145/3519941.3535065

## 1 Introduction

Finite-state machines are a universal formalism for modeling, programming and verifying real-time and reactive systems,

*LCTES '22, June 14, 2022, San Diego, CA, USA* © 2022 Association for Computing Machinery. ACM ISBN 978-1-4503-9266-2/22/06...\$15.00

https://doi.org/10.1145/3519941.3535065

and are widely used in industiral automation, public transportation systems, medical equipment, as well as avionics.

While many imperative languages, such as C and Verilog, support finite-state machines via encoding, it is more advantageous to have finite-state machines as a language construct to facilitate design, construction, and verification of embedded systems.

However, most such programming models of finite-state machines are in an *imperative* style, instead of being *declara-tive* [14, 27]. It is well known that declarative programming bridges the gap of specification and implementation, facilitates program transformation and verification, and at the same time less error-prone than imperative programming [15, 18, 20, 22, 26, 33]. We therefore ask the following question:

## Can we have a declarative design of finitestate machines to facilitate design, construction, and verification of embedded systems?

The main idea of this paper is to show that by adhering to the design principle of declarativeness, we discover a novel abstraction, which we call *implicit state machines*, and which answers the question above affirmatively.

At the high-level, implicit state machines is based on the following observations about FSMs: (1) state transitions do not have to be explicitly enumerated state by state; (2) state transition is a function; and (3) the inputs of the state machines do not need to be declared explicitly (Section 2).

Our contributions are listed below:

- Following the design principle of declarativeness, we discover a novel abstraction, *implicit state machines*, which are simple, flexible, universal and recursively composable.
- We formalize the concept of implicit state machines in a core calculus specialized with the domain of Boolean algebra. We show that it can serve as a simple and elegant model for sequential digital circuits, which is not known previously.
- We develop an embedded DSL in Scala for digital design based on implicit state machines, and we assess its practicality by designing a micro-controller in the DSL.

## 2 Deriving Implicit State Machines

Readers may skip this section and jump to the intuitive introduction of implicit state machines on the first read (Section 3).

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from permissions@acm.org.

Mathematically, a finite state machine is usually represented as a quintuple  $(I, S, s_0, \sigma, O)^{-1}$ :

- *I* is the set of inputs;
- *S* is the set of states;
- $s_0 \in S$  is the initial state;
- σ : I×S → S×O maps the input and the current state to the next state and the output;
- *O* is the set of outputs.

FSM can also be represented graphically by *state-transition diagrams*, as the following figure shows:



In the state machine above,  $q_1$  is the initial state, and each edge denotes a state transition: the label 0/1 on the edge means the transition happens when the input is 0, and it outputs 1 when the transition occurs.

*Implicit state machines* are based on a reflection on the essence of FSMs: a mapping from input and state to the next state and output.

**Insight 1**. The first insight towards implicit state machines is that *state transitions do not need to be explicitly enumerated*, as it is taken for granted in existing languages for programming with FSMs [12, 14, 21, 27].

In a declarative language, the mapping can be represented by any expression. This gives us a tentative representation as follows:

$$\lambda x: I \times S. (t_1, t_2)$$
 :  $I \times S \to S \times O$ 

The body  $(t_1, t_2)$  enforces that the output and next state are implemented as two functions. This imposes unnecessary syntatic constraints. If we introduce tuples in the language, we can replace  $(t_1, t_2)$  just by t:

$$\lambda x: I \times S. t \qquad : \qquad I \times S \to S \times O$$

**Insight 2**. The second insight is that *the state is neither an input to an FSM nor an output of an FSM, but an internal value.* It leads us to the following representation with the state variable *s*:

$$\lambda x: I. fsm \{ s \Longrightarrow t \} : I \to O$$

In the above, the term *t* still has the type  $S \times O$ . But seen from outside, a state machine just maps input to output, which corresponds to our intuition.

**Insight 3**. The last insight is that *the inputs do not need to be declared explicitly*, they can be *captured* from the lexical



Figure 1. D flip-flops and its semantics



Figure 2. A 4-bit serial-in serial-out shift register

scope, similar to capture in lambda calculus [3]:

$$fsm \{ s \Rightarrow t \} : O$$

We still miss the initial state, so we use the value v to denote the initial state of the FSM:

$$fsm \{ v \mid s \Rightarrow t \} : O$$

After all these steps, finally we arrived at a declarative representation of finite-state machines.

#### 3 Implicit State Machines, Informally

Suppose we are working in the domain of digital circuits. One of the most common state elements in digital circuits are *D flip-flops*, whose symbol and truth-table semantics are presented in Figure 1.

Intuitively, D flip-flops delay the input D by one clock. It can be seen from the truth table that the next state S' is always equal to the input D, and the output Q is always equal to the current state S.

Using implicit state machines, a one-bit D flip-flop with an input signal d can be represented as follows:

$$fsm \{ 0 \mid s \Rightarrow (d,s) \}$$

In the above, 0 represents the initial state of the D flip-flop; s represents the current state; d represents the input. The body is a pair (d, s), which means that the next state of the implicit state machine is the input d and the output is the current state s.

Note that in the above, the state variable s is bound, while the input d is not bound. This is a characteristic of implicit state machines, where the inputs are implicit, i.e., they are captured from the lexical environment, similar to capture in lambda calculus [3].

D flip-flops can be use to implement shift registers. In Figure 2, we implement a 4-bit serial-in serial-out shift register by chaining 4 D flip-flops. As a single D flip-flop delays

<sup>&</sup>lt;sup>1</sup>Technically, the quintuple described here is a Mealy machine, because it has an output. In embedded systems, pristine FSMs without output are not interesting.



Figure 3. A 4-bit serial-in parallel-out shift register

the input signal by one clock, intuitively the 4-bit serial-in serial-out shift register delays the input signal by 4 clocks.

We implement the 4-bit serial-in serial-out shift register for a given input d with implicit state machine as follows:

| let | q1 | = | fsm | { | 0 | Ι | s | => | (d,  | s) | } | in |
|-----|----|---|-----|---|---|---|---|----|------|----|---|----|
| let | q2 | = | fsm | { | 0 | Ι | s | => | (q1, | s) | } | in |
| let | q3 | = | fsm | { | 0 | Ι | s | => | (q2, | s) | } | in |
| let | q4 | = | fsm | { | 0 | Ι | s | => | (q3, | s) | } | in |
| q4  |    |   |     |   |   |   |   |    |      |    |   |    |

In the code above, we use the standard linguistic construct *let/in* to introduce local bindings.

Implicit state machines are just expressions, thus they may appear in any place where an expression is allowed. In particular, we may nest them to get another equivalent implementation of the 4-bit serial-in serial-out shift register:

```
fsm { 0 | s =>
    let q1 = fsm { 0 | s => (d, s) } in
    let q2 = fsm { 0 | s => (q1, s) } in
    let q3 = fsm { 0 | s => (q2, s) } in
    (q3, s)
}
```

An equivalent and simpler implementation of the 4-bit serial-in serial-out shift register is shown below:

fsm { (0, 0, 0, 0) | s => ((d, s.1, s.2, s.3), s.4) }

In the above, we use the syntax (t, ..., t) to represent a tuple, and *t.i* to represent the *i*-th component of the tuple *t*. In fact, we will show in the next section, there is a mechanic transformation from all other equivalent representation to this succinct form (Section 4.4).

There are also serial-in parallel-out shift registers, as shown in Figure 3. They can be implemented with implicit state machines as follows:

let q1 = fsm { 0 | s => (d, s) } in let q2 = fsm { 0 | s => (q1, s) } in let q3 = fsm { 0 | s => (q2, s) } in let q4 = fsm { 0 | s => (q3, s) } in (q1, q2, q3, q4)

An equivalent and simpler implementation of the 4-bit serial-in parallel-out shift register is shown below:

fsm { (0, 0, 0, 0) | s => ((d, s.1, s.2, s.3), s) }

We draw the readers' attention to the following properties of implicit state machines.

**Declarativeness**. In contrast to existing *imperative* programming models with finite-state machines [14, 27], implicit state machines are *declarative*. As we will see in the next section, the declarative nature of implicit state machines facilitate transformation of programs, thanks to *referential transparency*, which enables *substitute equals for equals* [33].

**Simplicity**. As we have seen in the example of D flip-flops, compared to the textbook presentation of D flip-flops in the form of truth tables, the representation based on implicit state machines is much simpler and more intuitive. While simple concepts can be explained in a complex way, we do not see how implicit state machines could be reduced to a simpler model.

**Flexibility**. Most existing programming models with finite state machines demand explicit enumeration of state transitions [12, 14, 21, 27]. In contrast, implicit state machines just do not mandate explicit enumeration of state transitions in the program. However, *they do not forbid that*. This means that programmers can continue to program with explicit states when necessary. This is can be done by introducing a *match*-expression:

```
fsm { 0 | s =>
  match (s, d) with
  case (0, 0) => (0, 0)
  case (0, 1) => (1, 0)
  case (1, 0) => (0, 1)
  case (1, 1) => (1, 1)
}
```

In the above, the match expression defines the semantics of D flip-flops in the form of truth tables (Figure 1).

**Recursive Composability**. Implicit state machines are recursively composable, which is a yardstick of proper language design. Recursive composability corresponds to the need for hierarchical decomposition in designing real-world systems.

**Universality**. The universality of implicit state machines are inherited from the universality of finite-state machines, as the latter can be represented by the former.

## 4 Implicit State Machines, Formally

In this section, we formalize implicit state machines in a small calculus with Boolean algebra as the domain intended for digital design.

#### 4.1 Syntax

The syntax of the calculus is presented below:

| t | ::= |                                    | terms                  |
|---|-----|------------------------------------|------------------------|
|   |     | a, b, c                            | external input         |
|   |     | x, y, z, s                         | variables              |
|   |     | let $x = t$ in $t$                 | let binding            |
|   |     | β                                  | Boolean value          |
|   |     | t * t                              | 1 bit and              |
|   |     | t + t                              | 1 bit or               |
|   |     | ! <i>t</i>                         | 1 bit not              |
|   |     | $(t,\ldots,t)$                     | tuple                  |
|   |     | t.i                                | projection             |
|   |     | $fsm \{ v \mid s \Rightarrow t \}$ | implicit state machine |
|   |     | • · · · ·                          | -                      |

| β | ::= | 0   1                       | Boolean values |
|---|-----|-----------------------------|----------------|
| v | ::= | $\beta \mid (v, \ldots, v)$ | values         |
| i | ::= | 0, 1, 2,                    | indexes        |

Beyond the basic elements of Boolean algebra, we also introduce *let*-bindings, which is a basic abstraction and reuse mechanism. Tuples and projections are introduced for parallel composition and decomposition. In a projection *t.i*, the index *i* must be a statically known number. For implicit state machines, we require that the initial state is a value.

A circuit usually has external inputs, which are represented by variables a, b, c. By convention, we use x, y, z for *let*-bindings, and s for the binding in implicit state machines.

We choose Boolean algebra as the domain theory, but it can also be other mathematical structures, for example natural numbers or tensors. Our transform does not assume properties of mathematical structures as long as we may *substitute equals for equals* [33].

To avoid technical details of same names in bindings, we assume the uniqueness of bound variables, which can be easily achieved via renaming.

#### 4.2 Semantics

The semantics follows the *synchronous hypothesis* [4], which assumes that the computation of the response to an input takes no time. For synchronous digital circuits, it means that the whole system produces an output at each clock tick.

The semantics of the language is defined with the help of a state  $\sigma$  and an environment  $\rho$ . The state  $\sigma$  maps a state variable to a state value, the environment variable  $\rho$  maps an external input to a value. The big-step operational semantics is defined with the following reduction relation:

$$t \xrightarrow{\sigma,\rho} v \mid \sigma'$$

It means that given the current state  $\sigma$  and environment  $\rho$ , the term *t* evaluates to the value *v* with the next state  $\sigma'$ . The reduction rules are defined in Figure 4. We explain the rules below:

• E-VALUE. If the term is already a value, do nothing. There are no nested state machines, thus the mapping for the next state is the empty set.

- E-INPUT. Look up the external variable *a* from the environment *ρ*.
- E-LET. First evaluate *t*<sub>1</sub> to the value *v*<sub>1</sub>, then evaluate *t*<sub>2</sub> with *x* replaced by *v*<sub>1</sub>.
- E-TUPLE. Evaluate each component in parallel to a value, and accumulate the mapping for the next state.
- E-PROJECT. First evaluate the term to a tuple value, then return the corresponding component.
- E-AND. Evaluate the two components in parallel to Boolean values, then call the helper method *and* to compute the resulting Boolean value β. As each component may contain implicit state machines, accumulate the mapping for the next state.
- E-OR. Similar as above, but use the helper function *or* to compute the resulting value.
- E-Not. Similar as above, but use the helper function *not* to compute the resulting value.
- E-FSM. First look up the value for the current state from the state map *σ*. Then evaluate the body of the state machine to a pair value (v<sub>1</sub>, v<sub>2</sub>). The output is v<sub>2</sub>, and the next state of the FSM is v<sub>1</sub>.

The reduction relation only defines one-tick semantics. The semantics of a system is defined by the *trace* of a given input series  $\rho_0, \rho_1, \cdots$ . We define it formally below:

**Definition 4.1** (Trace). The trace of a system *t* with respect to an input sequence  $\rho_0, \rho_1, \cdots$  is the sequence  $o_0, o_1, \cdots$  such that

- $t \xrightarrow{\sigma_0, \rho_0} o_0 \mid \sigma_1$ •  $\dots$ •  $t \xrightarrow{\sigma_i, \rho_i} o_i \mid \sigma_{i+1}$
- . . .

In the above,  $\sigma_0$  is the initial state of the implicit state machines as specified in *t*.

#### 4.3 Type System

To check well-formedness of programs, we introduce a simple type system to ensure that a well-typed program never gets stuck. The type system is presented in Figure 5.

In the system, there are two types: *Bool* for Boolean values and  $(T_1, \ldots, T_n)$  for tuples. We explain the typing rules below:

- T-BOOL. The type for Boolean values is always *Bool*.
- T-INPUT. For inputs, their types are predefined in the environment.
- T-VAR. For variables, their types also appear in the environment.

$$v \xrightarrow{\sigma, \rho} v \mid \emptyset$$
 (E-Value)  $a \xrightarrow{\sigma, \rho} \rho(a) \mid \emptyset$  (E-Input)

$$\frac{t_1 \xrightarrow{\sigma,\rho} v_1 \mid \sigma' \qquad [x \mapsto v_1]t_2 \xrightarrow{\sigma,\rho} v_2 \mid \sigma''}{let \ x = t_1 \ in \ t_2 \xrightarrow{\sigma,\rho} v \mid \sigma' \cup \sigma''}$$
(E-Let)

$$\frac{t_1 \xrightarrow{\sigma,\rho} v_1 \mid \sigma_1 \qquad \dots \qquad t_n \xrightarrow{\sigma,\rho} v_n \mid \sigma_n}{(t_1,\dots,t_n) \xrightarrow{\sigma,\rho} (v_1,\dots,v_n) \mid \sigma_1 \cup \dots \cup \sigma_n}$$
(E-TUPLE)

$$\frac{t \xrightarrow{\sigma,\rho} (v_1, \dots, v_i, \dots, v_n) \mid \sigma'}{t.i \xrightarrow{\sigma,\rho} v_i \mid \sigma'}$$
(E-Project)

$$\frac{t_1 \xrightarrow{\sigma,\rho} \beta_1 \mid \sigma' \quad t_2 \xrightarrow{\sigma,\rho} \beta_2 \mid \sigma'' \quad \beta = and(\beta_1,\beta_2)}{t_1 * t_2 \xrightarrow{\sigma,\rho} \beta \mid \sigma' \cup \sigma''}$$
(E-AND)

$$\frac{t_1 \xrightarrow{\sigma,\rho} \beta_1 \mid \sigma' \quad t_2 \xrightarrow{\sigma,\rho} \beta_2 \mid \sigma'' \quad \beta = or(\beta_1, \beta_2)}{t_1 + t_2 \xrightarrow{\sigma,\rho} \beta \mid \sigma' \cup \sigma''}$$
(E-OR)

$$\frac{t \xrightarrow{\sigma,\rho} \beta \mid \sigma' \qquad \beta' = not(\beta)}{!t \xrightarrow{\sigma,\rho} \beta' \mid \sigma'}$$
(E-Nor)

$$\frac{v = \sigma(s) \quad [s \mapsto v]t \mid \xrightarrow{\sigma, \rho} (v_1, v_2) \mid \sigma'}{fsm \{ v_0 \mid s \Rightarrow t \} \xrightarrow{\sigma, \rho} v_2 \mid \{ s \mapsto v_1 \} \cup \sigma'}$$
(E-FSM)

## Figure 4. Big-step operational semantics

$$T ::= Bool \mid (T, \ldots, T)$$

$$\begin{split} \Gamma \vdash \beta : Bool & (T-BOOL) & \frac{\Gamma \vdash t : (T_1, \dots, T_i, \dots, T_n)}{\Gamma \vdash t : : T_i} & (T-PROJECT) \\ \\ \frac{a: T \in \Gamma}{\Gamma \vdash a: T} & (T-INPUT) & \frac{\Gamma \vdash t_1 : Bool & \Gamma \vdash t_2 : Bool}{\Gamma \vdash t_1 * t_2 : Bool} & (T-AND) \\ \\ \frac{x: T \in \Gamma}{\Gamma \vdash x: T} & (T-VAR) & \frac{\Gamma \vdash t_1 : Bool & \Gamma \vdash t_2 : Bool}{\Gamma \vdash t_1 + t_2 : Bool} & (T-OR) \\ \\ \\ \frac{\Gamma \vdash t : Bool}{\Gamma \vdash t_1 : Bool} & (T-NOT) & \frac{\Gamma \vdash t_1 : T_1 & \Gamma, x: T_1 \vdash t_2 : T_2}{\Gamma \vdash let x = t_1 in t_2 : T_2} & (T-LET) \end{split}$$

$$\frac{\Gamma \vdash t_1 : T_1 \quad \dots \quad \Gamma \vdash t_n : T_n}{\Gamma \vdash (t_1, \dots, t_n) : (T_1, \dots, T_n)} \quad (T-TUPLE) \qquad \qquad \frac{\Gamma \vdash v : T_1 \quad \Gamma, s: T_1 \vdash t : (T_1, T_2)}{\Gamma \vdash fsm \{ v \mid s \Rightarrow t \} : T_2} \quad (T-FSM)$$

Figure 5. Type System

- T-Not. The term *t* must be of the type *Bool*.
- T-TUPLE. If each component has a type, and then the type of the tuple has a corresponding tuple type.
- T-PROJECT. If the term *t* has a tuple type, then the projection has the type of the corresponding component.
- T-AND. If each component has the type *Bool*, the result also has the type *Bool*.
- T-OR. The same as above.
- T-LET. If the bound term has the type of *T*<sub>1</sub>, and the body of the let-binding has the type *T*<sub>2</sub> under the environment Γ extended with the binding *x*:*T*<sub>1</sub>, then the let-binding has the type *T*<sub>2</sub>. Note that this rule forbids the usage of *x* in *t*<sub>1</sub>, which prevents undesired circles.
- T-FsM. If the initial value has the type  $T_1$ , and the body has the type  $(T_1, T_2)$  under the environment  $\Gamma$  extended with the binding *s*: $T_1$ , then the implicit state machine has the type  $T_2$ .

For the meta-theory of the type system, we need to define well-formedness of the input map and state map. We write  $\Gamma \vdash \xi$  to mean that the input map or state map  $\xi$  is well-typed under  $\Gamma$ , which is defined as follows:

$$\Gamma \vdash \emptyset \qquad \qquad \frac{\Gamma \vdash \xi \quad \emptyset \vdash v : T}{\Gamma. \alpha : T \vdash \xi \cup \{ \alpha \mapsto v \}}$$

In the above,  $\alpha$  ranges over input and state variables, and  $\xi$  ranges over input map and state map.

**Theorem 4.2** (Soundness). If  $\Gamma \vdash t : T$ , and if for each  $\rho_i$  in the input sequence  $\rho_0, \rho_1, \ldots$  we have  $\Gamma \vdash \rho_i$ , then there exists a trace for the system t corresponding to the input sequence.

The proof follows from the following lemma by induction on the length of the input sequence:

**Lemma 4.3.** If t is well-typed under the environment  $\Gamma$ , and the input map  $\rho$  is compatible with  $\Gamma$ , and the state map  $\sigma$  is type-compatible with the initial state map  $\sigma_0$  as specified in t, then t evaluates to a value v with updated state map  $\sigma'$ .

More formally, if  $\Gamma \vdash t : T$ , and  $\Gamma \vdash \rho$ , and there exists  $\Gamma'$ such that  $\Gamma' \vdash \sigma$  and  $\Gamma' \vdash \sigma_0$ , then there exists v and  $\sigma'$  such that  $t \xrightarrow{\sigma,\rho} v \mid \sigma', \Gamma \vdash v : T$  and  $\Gamma' \vdash \sigma'$ .

*Sketch.* By induction on the typing judgment  $\Gamma \vdash t : T$ .  $\Box$ 

#### 4.4 Flattening

In this section, we show that any system of implicit state machines is equivalent to a single flat implicit state machine. This can be achieved by a mechanic transformation. For the purpose of the transformation, we first define the *combinational fragment* of the language devoid of implicit state machines, which is represented by *e*:

$$e ::= \beta | e * e | e + e | !e | (e, ..., e) | e.i |$$
  
let x = e in e | x | s | a

The combinational fragment corresponds to combinational circuits, i.e., circuits without state elements, in contrast to sequential circuits.

The transformation consists of two major steps:

- Lifting: lifts FSMs to top-level (Figure 6).
- Merging: merges FSMs to a single FSM (Figure 7).

Lifting (Figure 6) results in *lifted normal form* (u) where all FSMs are nested at the top-level of the program, with a combinational fragment in the middle:

$$u ::= e \mid fsm \{ v \mid s \Rightarrow u \}$$

The relation  $t_1 \sim_L t_2$  says that the term  $t_1$  takes a lifting step to  $t_2$ . Lifting is defined with the help of the lifting context *L*. The lifting context specifies that the transformation follows the order left-right and top-down. The actual lifting happens with the function  $[\cdot]$ , which transforms the source program to the expected form. We explain the concrete transformation rules below:

- $fsm \{ v \mid s \Rightarrow e_1 \} * t_2$ . The FSM absorbs  $t_2$  into its body. The symmetric case, and the cases for AND and OR are similar.
- *let*  $x = fsm \{ v \mid s \Rightarrow e_1 \}$  *in*  $t_2$ . It pulls the letbinding into the body. The case in which FSM is in the body of let-binding is similar.
- $fsm \{ v \mid s \Rightarrow e \}$ .*i*. It pulls the projection into the body of FSM.
- $(\bar{e}, fsm \{ v \mid s \Rightarrow e \}, \bar{t})$ . It pulls the tuple into the body of FSM.

Note that to simplify the presentation, in the transformation rules we write let  $x, y = t_1$  in  $t_2$  as a syntactic sugar for let  $z = t_1$  in let x = z.1 in let y = z.2 in  $t_2$ .

Once all FSMs are nested at the top-level after lifting, merging (Figure 7) takes place. The relation  $u_1 \sim_M u_2$  says that the term  $u_1$  takes a merging step to  $u_2$ . Merging is defined with the help of the merging context M. The merging context specifies that the merging happens from inside towards outside. The actual merging step is quite straightforward: it just combines the initial states  $v_1$  and  $v_2$ , as well as merges  $s_1$  and  $s_2$  into s.

#### 4.5 Discussion

Flattening makes it immediately obvious that a digital circuit with state elements (such as registers and flip-flops) are equivalent to a combinational circuit with all state elements at the boundary.

Figure 6. Lifting of nested FSMs.

 $M ::= [\cdot] \mid fsm \{ v \mid s \Rightarrow M \}$  $\underbrace{\llbracket u \rrbracket = fsm \{ v \mid s \Rightarrow e \}}_{M[u] \rightsquigarrow_M M[fsm \{ v \mid s \Rightarrow e \}]}$ 

 $[ fsm \{ v_1 \mid s_1 \Rightarrow fsm \{ v_2 \mid s_2 \Rightarrow e_2 \} \} ] = fsm \{ (v_1, v_2) \mid s \Rightarrow let s_1, s_2 = s in let x = e_2 in ((x.2.1, x.1), x.2.2) \}$ 

Figure 7. Merging of nested FSMs.

We believe the insight itself is not new, however, implicit state machines make it obvious. In contrast, it is obscured in the network-based model of digital circuits, e.g., it is not obvious how to push a D flip-flop in the middle of a circuit network to its boundary.

The declarative nature of implicit state machines enables the reasoning principle of *substituting equals for equals* [33]. It facilitates many common program optimizations, such as dead-code elimination, common-subexpression elimination, constant folding, etc.

[35] hold the view that it is a golden age for applying programming language techniques for improving hardware design. We believe implicit state machines may contribute to that initiative.

## 5 Implicit State Machines in Scala

To assess the feasibility of implicit state machines as a programming construct, we implemented an embedded DSL in Scala for digital design. We experimented usability of the embedded DSL by creating circuits of varying complexity, from half adders to a micro-controller.

#### 5.1 Embedded DSL

For readers not familiar with DSLs, there are generally two approaches to implement a DSL:

- External DSL, in which the DSL is implemented with a standalone compiler (Figure 8)
- Embedded DSL, in which the DSL is defined as a library within a host language (Figure 9)

In the external approach, the language designer defines syntax of the DSL, users write DSL programs and then feed the source code into the DSL compiler. For practicality, there is the need to provide IDE support for the DSL to improve programming experience.

In the embedded approach, the language designer only needs to define the *abstract syntax tree* (AST) data format and provide core compiler phases as a library in an implementation language, e.g., Scala. Users write DSL programs in Scala to directly construct the ASTs, and then feed them into the compilation pipeline. As programmers write code in an existing language, e.g., Scala, there is no need to provide additional IDE support.

Given that the embedded approach avoids the overhead of defining concrete syntax of the DSL and providing IDE suport, we follow the approach in our work.

Our DSL is based on implicit state machines extended with pairs and bit vectors. Implicit state machines are the only state elements in the DSL. An excerpt of the abstract syntax tree definitions is presented in Figure 10.



Figure 9. Architecture of Embedded DSLs

```
sealed abstract class Sig[T <: Type] // base class of AST
case class Fsm[S <: Type, T <: Type](sym: Symbol, init: Value, body: Sig[S ~ T]) extends Sig[T]
case class Let[S <: Type, T <: Type](sym: Symbol, sig: Sig[S], body: Sig[T]) extends Sig[T]
case class Var[T <: Type](sym: Symbol, tpe: Type) extends Sig[T] // variable for inputs and bindings
case class And[T <: Num](lhs: Sig[Vec[T]], rhs: Sig[Vec[T]]) extends Sig[Vec[T]]
case class Mux[T <: Type](cond: Sig[Bit], thenp: Sig[T], elsep: Sig[T]) extends Sig[T]
sealed abstract class Type // base class of types
case class PairT[S <: Type, T <: Type](lhs: S, rhs: T) extends Type
case class VecT[T <: Num](width: T) extends Type</pre>
```

Figure 10. An excerpt of abstract syntax trees of the DSL

The class Sig is the base class of abstract syntax trees, and the class Type is the base class of the types of signals. The DSL also defines the following aliases for types:

```
type ~[S <: Type, T <: Type] = PairT[S, T]</pre>
```

```
2 type Vec[T <: Num] = VecT[T]</pre>
```

```
s type Bit = VecT[1]
```

```
4 type Num = Int
```

The type Sig[Bit] denotes signals of 1-bit vector, which is an alias of Sig[Vec[1]]. The type Sig[Vec[2]] denotes signals of 2-bit vector. Here we take advantage of *literal types* in Scala [30], which supports the usage of literal values as types.

The DSL supports common bit-wise operations such as XOR, AND, OR, ADD, SUB, SHIFT and MUX. All these operations are supported in Verilog [23], and they follow the same semantics as in Verilog.

The design intentionall makes the class Sig take an additional type parameter, which signifies the type of the signal. This way, we can profit the Scala type system to automatically check signal mismatch errors, e.g., perform OR operation on a 4-bit and an 8-bit signal. The additional type parameter does not play any role at run-time.

#### 5.2 A Quick Glance

The following code shows how we may implement a half adder in our DSL:

def halfAdder(a: Sig[Bit], b: Sig[Bit]) =

```
val s = a ^ b
val c = a & b
```

c ++ s

The operator ++ concatenates two bit vectors to form a bigger bit vector - Sig[Vec[2]] in the example above.

We may compose two half adders to create a full adder, which takes a carry cin as input:

```
1 def full(a: Sig[Bit], b: Sig[Bit], cin: Sig[Bit]) =
2 val ab = halfAdder(a, b)
3 val s = halfAdder(ab(0), cin)
4 val cout = ab(1) | s(1)
5 cout ++ s(0)
```

In the above, we make two calls to halfAdder. Each call will create a copy of the half adder circuit to be composed in the fuller adder. It returns the carry and the sum. We may compose them further to create a 2-bit adder:

```
1 def adder2(a: Sig[Vec[2]], b: Sig[Vec[2]]) =
2 val cs0 = full(a(0), b(0), 0)
3 val cs1 = full(a(1), b(1), cs0(1))
```

```
4 cs1(1) + cs1(0) + cs0(0)
```

Implicit State Machines

To actually generate a representation of the circuit, we need to specify the input signals:

val a = variable[Vec[2]]("a")
val b = variable[Vec[2]]("b")
val circuit = adder2(a, b)

Then we can create a simulator of the circuit:

```
val add2 = circuit.eval(a, b)
```

Finally, we can test the simulator:

```
add2(List(Value(1, 0), Value(0, 1))) match
```

case Value(0, 1, 1) => println("success")

#### 5.3 Sequential Circuits

We show how to create sequential circuits with the example of moving average filter. The moving average filter we are going to implement is specified below:

$$Y_i = (X_i + 2 * X_{i-1} + X_{i-2})/4$$

For the input  $X_i$ , the output  $Y_i$  also depends on the previous values  $X_{i-1}$  and  $X_{i-2}$ . We can define an operator delay based on implicit state machines:

```
1 def delay[T <: Type](sig: Sig[T], init: Value) =
2 fsm("delay", init) { (last: Sig[T]) =>
3 sig ~ last
4 }
```

In the code above, we declare an implicit state machine with the specified initial state init. The body of the FSM is a pair sig ~ last, where the first part becomes the next state, and the second part becomes the output.

Now we may create the circuit for the moving average:

```
1 def movingAverage(in: Sig[Vec[8]]) =
2 val z1 = delay(in, 0.toValue(8))
3 val z2 = delay(z1, 0.toValue(8))
```

```
4 (in + (z1 << 1) + z2) >> 2
```

In the code above, we first create an instance of the delay circuit and bind it to the variable z1. Then we delay the signal z1 to get z2. Finally, the equation is encoded straightforwardly.

Note that in the above, the end user is programming in dataflow style à la Lustre [11]. There is no need for the programmer to think in terms of state machines in such use cases. We discuss this in a broader context in Section 5.6.

## 5.4 Verilog Generation

We can generate Verilog code for the moving average filter as follows:

```
val a = variable[Vec[8]]("a")
```

```
val circuit = movingAverage(a)
```

```
s circuit.toVerilog("Filter", a)
```

The generated Verilog code is presented in Figure 11. In the Verilog code, lines 9-14 deal with sequential logic, the other code deal with combinational logic.

```
module Filter (CLK, a, out);
1
     input CLK;
     input [7:0] a;
3
     output [7:0] out;
4
     reg [15:0] s;
5
6
     assign out = (s[7:0] + (s[15:8] << 1'b1) + a)
          >> 2'b10;
8
     initial begin
9
       s = 16'b00000000000000;
10
     end
     always @ (posedge CLK)
13
       s <= { a, s[15:8] };
14
   endmodule
15
```

**Figure 11.** Generated Verilog code for the moving average filter (redundant parenthesese at line 8 manually removed for the sake of readability)



Figure 12. Architecture of the micro-controller

This is the typical code generated by our DSL compiler, as it performs flattening of the circuit (Section 4.4), which results in a single finite-state machine with a combinational core.

The Verilog code generation takes a flattened AST representation of the circuit as input. It goes through the AST representation and translate the DSL primitive with corresponding Verilog code. As the translation process is standard, we omit the details here.

## 5.5 Case Study: Micro-controller

We implemented an accumulator-based micro-controller in the DSL inspired by Leros [31].

The architecture of the micro-controller is shown in Figure 12. At the high-level, the micro-controller contains three architectural states: the program counter (PC), the accumulator register (ACC) and the pending status (MODE). The micro-controller interfaces with a memory bus, which contains a simple protocol consisting of read address, control (read / write) and data. The micro-controller contains an on-chip read-only instruction memory, which is different from the external data memory interfaced by the bus.

The micro-controller is implemented with an implicit state machine:

```
1 fsm("processor", pc0 ~ acc0 ~ mode0) { state =>
2 val pc ~ acc ~ mode = state
3 // ...
4 }
```

The variable pc refers to the program counter, acc is the accumulator register, mode indicates whether the controller is waiting for data from the external memory.

The skeleton of the implementation is as follows:

```
let("pcNext", pc + 1) { pcNext =>
     let("instr", readInstr(pc)) { instr =>
         /* ... */
3
         when (opcode === ADDI.toSig(8)) {
4
           val acc2 = acc + operand
5
           next(acc = acc2)
         }
7
         /* ... */
      }
9
   }
10
```

It first increments the program counter pc and bind the result to pcNext. Then it binds the current instruction to instr. At the circuit-level, the operations are executed in parallel. Finally, the instruction is decoded and executed in a series of when constructs, depending on the mode of the micro-controller.

The when construct is a syntactic sugar created from multiplexers that supports selecting one of two n-bit inputs based a 1-bit control signal.

Eventually, each branch calls the local method next with appropriate arguments:

```
1 def next(pc: Sig[PC] = pcNext,
2 acc: Sig[ACC] = acc,
3 mode: Sig[Bit] = 0,
4 out: Sig[BusOut] = 0)
5 = (pc ~ acc ~ mode) ~ out
```

As can be seen from above, the method next defines default values for all arguments, such that each branch may only specify parameters that are different. For example, the following code handles the unconditional jump instruction BR:

```
when (opcode === BR.toSig(8)) {
    next(pc = pc + jmpOffset)
  }
```

The indirect ADD instruction needs to load data from external memory, thus putting the controller in the *pending* mode, as the following code shows:

```
when (opcode === ADD.toSig(8)) {
    next(pc = pc, mode = 1, out = readReq(instr))
  }
```

Fengyun Liu and Aleksandar Prokopec

The logic for the pending mode is as follows:

```
when (mode) {
    /* pending mode */
    when (opcode === ADD.toSig(8)) {
    next(acc = acc + busIn)
    }
    /* ... */
  } otherwise {
    /* normal mode */
    }
```

The code above depends on the protocol which requires that the I/O devices make the requested data available on the bus in the cycle following the request.

The programming experience is largely positive, thanks to the declarative nature of the DSL. Compared to VHDL or Verilog, there are no "wires" to connect in the DSL and there are no combinational cyles by construction.

We test the implementation with small assembly programs and verify the result with a circuit simulator in Scala. We are aware that the micro-controller is still too simple and it may not match quality standards. For example, we do not implement pipelining [25] nor do we separate out a reusable arithmetic-logic unit (ALU) for the two execution modes.

However, we conjecture that implicit state machines make it possible to automate some of such optimizations using compilation techniques. We leave it for future work to capitalize on such insights to implement RISC-V cores and compare with the state-of-the-art open source implementations.

#### 5.6 Discussion: State Machine VS. Dataflow

It has long been observed that embedded systems fall into two categories: (1) control-dominated applications and (2) data-oriented applications [14]. For control-dominated applications, programming based on finite-state machines is a good fit. For data-oriented applications, declarative dataflow programming is a good fit. However, real systems are usually a mix of both styles, which motivates the extension of the declarative dataflow language Lustre [11] with state machines [12, 14]. The extension is in imperative style with explicit state representation, and it desugars to a core dataflow calculus.

Our work can be seen as taken an opposite approach to [14]: Instead of desugaring finite-state machines into a core dataflow calculus, we make implicit state machines as the fundamental building block, and desugar dataflow programming constructs to implicit state machines (Section 5.3). Given that the dataflow calculus of Lustre eventually compiles to finite-state machines for execution, we believe the introduction of implicit state machines as a primitive will be an interesting addition to the programming methodologies of real-time and embedded systems.

#### 5.7 Limitations

There are several limitations of the current DSL:

- It does not support multi-clock design.
- It only supports binary state, no analog nor tri-state.
- It does not perform logic optimizations on the circuit.

While the DSL is useful to assess the practicality of implicit state machines as a programing model, it is not a productionready artifact. Meanwhile, none of the limitations above is an inherent drawback of implicit state machines as an abstraction.

## 6 Related Work

We have discussed related work in Section 4.5 and Section 5.6. Here we would like to acknowledge more work that inspired our research.

Our work is influenced by the french synchronous languages, Esterel [7], Signal [6] and Lustre [11]. In particular, the semantics of implicit state machines follow the *synchrony hypothesis* [5]. There are ongoing efforts in formalizing and mechanizing the semantics of these languages [10, 16] as well as verifying programs in these languages [34], which could be a direction for our future work.

There exists plenty of intermediate representations (IR) for hardware design, such as Calyx [28], FIRRTL [24], LLHD [32]. We believe implicit state machines will be a useful abstraction in the design of IRs due to its declarativeness, simplicity and universality.

Implicit state machines bear some similarity to state monads in functional programming [36]. From the programmer's perspective, there are three main differences: (1) state monads require programmers to thread-through the state explicitly in the program, while there is no such requirement for implicit state machines; (2) state monads do not allow programmers to specify the initial state in a decentralized way; (3) composing two state monads incurs overhead, while compositionability is a feature of implicit state machines. From the perspective of compiler writers, state monads are just design patterns in functional programming, they are categorically different from IRs that compiler phases can work on.

There are many DSLs for digital design. The Lava family DSLs [8, 17] use *delay* as a primitive to represent state, which can be thought as a restricted version of Lustre [11] embedded in Haskell, i.e., they are in a dataflow style as Lustre. Chisel [2] uses registers as a state primitive and follows an imperative programming style. Bluespec [29], Kami [13] and Koika [9] are based on guarded atomic actions. Our DSL is different in the sense that it is based on the novel state primitive — implicit state machines.

Graphical representation of programs seems to be favored over text-based programs in some application domains. There are several visual languages for programming with finitestate machines, such as Statecharts [21], SyncCharts [1], Simulink/Stateflow [19]. We are investigating how to combine the benefits of visualizal languages and text-based languages in programming embedded systems.

## 7 Conclusion

In this paper, we showed that by sticking to the design principle of declarativeness, we arrive at a novel abstraction: *implicit state machines*. Implicit state machines are recursively composable and universal, which makes them promising both as a programming model as well as intermediate representation.

We formalized the concept of implicit state machines in a calculus with Boolean algebra as the domain and showed that it serves as an elegant model of digital circuits.

We implemented an embedded DSL in Scala based on implicit state machines, which supports both dataflow style programming and state-machine style programming. We implemented a micro-controller in the DSL and the experience of programming with implicit state machines is positive.

**Future Work**. We are considering designing a standalone domain-specific language based on implicit state machines for the application domains of Internet of Things (IoT) and industrial automation.

## Acknowledgments

Fengyun Liu thanks the Programming Methods Laboratory (LAMP) at École polytechnique fédérale de Lausanne (EPFL) for hosting the research as part of his PhD studies. We also thank Prof. Paolo Ienne, Prof. Viktor Kunčak, Prof. Martin Odersky and Dr. Aggelos Biboudis for helpful discussions.

## References

- [1] Charles André and Marie-Agnès Peraldi-Frati. 2000. Behavioral Specification of a Circuit Using SyncCharts: A Case Study. In 26th EUROMI-CRO 2000 Conference, Informatics: Inventing the Future, 5-7 September 2000, Maastricht, The Netherlands. IEEE Computer Society, 1091. https://doi.org/10.1109/EURMIC.2000.874620
- [2] Jonathan Bachrach, Huy Vo, Brian C. Richards, Yunsup Lee, Andrew Waterman, Rimas Avizienis, John Wawrzynek, and Krste Asanovic. 2012. Chisel: Constructing hardware in a Scala embedded language. DAC Design Automation Conference 2012 (2012), 1212–1221.
- [3] Hendrik Pieter Barendregt. 1985. The lambda calculus its syntax and semantics. Studies in logic and the foundations of mathematics, Vol. 103. North-Holland.
- [4] A. Benveniste and G. Berry. 1991. The synchronous approach to reactive and real-time systems. *Proc. IEEE* 79, 9 (Sept. 1991). https: //doi.org/10.1109/5.97297
- [5] Albert Benveniste, Paul Caspi, Stephen A. Edwards, Nicolas Halbwachs, Paul Le Guernic, and Robert de Simone. 2003. The synchronous languages 12 years later. *Proc. IEEE* 91, 1 (2003), 64–83. https://doi.org/10.1109/JPROC.2002.805826
- [6] Albert Benveniste, Paul Le Guernic, and Christian Jacquemot. 1991. Synchronous programming with events and relations: the SIGNAL language and its semantics. *Science of Computer Programming* 16, 2 (Sept. 1991). https://doi.org/10.1016/0167-6423(91)90001-E
- [7] Gérard Berry and Georges Gonthier. 1992. The Esterel Synchronous Programming Language: Design, Semantics, Implementation. Sci.

*Comput. Program.* 19, 2 (1992), 87–152. https://doi.org/10.1016/0167-6423(92)90005-V

- [8] Per Bjesse, Koen Claessen, Mary Sheeran, and Satnam Singh. 1998. Lava: Hardware Design in Haskell. In Proceedings of the third ACM SIG-PLAN International Conference on Functional Programming (ICFP '98), Baltimore, Maryland, USA, September 27-29, 1998, Matthias Felleisen, Paul Hudak, and Christian Queinnec (Eds.). ACM, 174–184. https: //doi.org/10.1145/289423.289440
- [9] Thomas Bourgeat, Clément Pit-Claudel, Adam Chlipala, and Arvind. 2020. The essence of Bluespec: a core language for rule-based hardware design. In Proceedings of the 41st ACM SIGPLAN International Conference on Programming Language Design and Implementation, PLDI 2020, London, UK, June 15-20, 2020, Alastair F. Donaldson and Emina Torlak (Eds.). ACM, 243–257. https://doi.org/10.1145/3385412.3385965
- [10] Timothy Bourke, Lélio Brun, Pierre-Évariste Dagand, Xavier Leroy, Marc Pouzet, and Lionel Rieg. 2017. A Formally Verified Compiler for Lustre. (2017).
- [11] P. Caspi, D. Pilaud, N. Halbwachs, and J. A. Plaice. 1987. LUSTRE: A Declarative Language for Real-time Programming. In Proceedings of the 14th ACM SIGACT-SIGPLAN Symposium on Principles of Programming Languages (POPL '87). ACM, New York, NY, USA. https://doi.org/10. 1145/41625.41641 event-place: Munich, West Germany.
- [12] Paul Caspi and Marc Pouzet. 2008. Synchronous Functional Programming : The Lucid Synchrone Experiment.
- [13] Joonwon Choi, Muralidaran Vijayaraghavan, Benjamin Sherman, Adam Chlipala, and Arvind. 2017. Kami: a platform for high-level parametric hardware specification and its modular verification. *Proc. ACM Program. Lang.* 1, ICFP (2017), 24:1–24:30. https://doi.org/10. 1145/3110268
- [14] Jean-Louis Colaço, Bruno Pagano, and Marc Pouzet. 2005. A conservative extension of synchronous data-flow with state machines. In Proceedings of the 5th ACM international conference on Embedded software - EMSOFT '05. ACM Press, Jersey City, NJ, USA. https: //doi.org/10.1145/1086228.1086261
- [15] Mary F. Fernández, Daniela Florescu, Alon Y. Halevy, and Dan Suciu. 2000. Declarative specification of Web sites with Strudel. *The VLDB Journal* 9 (2000), 38–55.
- [16] Spencer P. Florence, Shu-Hung You, Jesse A. Tov, and Robert Bruce Findler. 2019. A calculus for Esterel: if can, can. if no can, no can. *Proceedings of the ACM on Programming Languages* 3, POPL (Jan. 2019). https://doi.org/10.1145/3290374
- [17] Andy Gill, Tristan Bull, Garrin Kimmell, Erik Perrins, Ed Komp, and Brett Werling. 2009. Introducing Kansas Lava. In *Implementation and Application of Functional Languages - 21st International Symposium*, *IFL 2009, South Orange, NJ, USA, September 23-25, 2009, Revised Selected Papers (Lecture Notes in Computer Science, Vol. 6041)*, Marco T. Morazán and Sven-Bodo Scholz (Eds.). Springer, 18–35. https://doi.org/10.1007/ 978-3-642-16478-1\_2
- [18] Nicolas Halbwachs, Daniel Pilaud, Farid Ouabdesselam, and Anne-Cecile Glory. 1989. Specifying, Programming and Verifying Real-Time Systems Using a Synchronous Declarative Language. In Automatic Verification Methods for Finite State Systems, International Workshop, Grenoble, France, June 12-14, 1989, Proceedings (Lecture Notes in Computer Science, Vol. 407), Joseph Sifakis (Ed.). Springer, 213–231. https://doi.org/10.1007/3-540-52148-8\_18
- [19] Grégoire Hamon and John M. Rushby. 2007. An operational semantics for Stateflow. Int. J. Softw. Tools Technol. Transf. 9, 5-6 (2007), 447–456. https://doi.org/10.1007/s10009-007-0049-7
- [20] Michael Hanus and Christof Kluß. 2009. Declarative Programming of User Interfaces. In PADL.
- [21] David Harel. 1987. Statecharts: a visual formalism for complex systems. Science of Computer Programming 8, 3 (June 1987). https://doi.org/10. 1016/0167-6423(87)90035-9
- [22] Timothy L. Hinrichs. 2011. Plato: A Compiler for Interactive Web Forms. In *PADL*.

- [23] IEEE. 2005. IEEE Standard for Verilog Hardware Description Language. IEEE.
- [24] Adam M. Izraelevitz, Jack Koenig, Patrick Li, Richard Lin, Angie Wang, Albert Magyar, Donggyu Kim, Colin Schmidt, Chick Markley, Jim Lawson, and Jonathan Bachrach. 2017. Reusability is FIRRTL ground: Hardware construction languages, compiler frameworks, and transformations. In 2017 IEEE/ACM International Conference on Computer-Aided Design, ICCAD 2017, Irvine, CA, USA, November 13-16, 2017, Sri Parameswaran (Ed.). IEEE, 209–216. https://doi.org/10.1109/ICCAD. 2017.8203780
- [25] Daniel Kroening and Wolfgang J. Paul. 2001. Automated Pipeline Design. In Proceedings of the 38th Design Automation Conference, DAC 2001, Las Vegas, NV, USA, June 18-22, 2001. ACM, 810–815. https: //doi.org/10.1145/378239.379071
- [26] Michael Leuschel. 2008. Declarative programming for verification: lessons and outlook. In PPDP '08.
- [27] Xun Li, Mohit Tiwari, Jason Oberg, Vineeth Kashyap, Frederic T. Chong, Timothy Sherwood, and Ben Hardekopf. 2011. Caisson: a hardware description language for secure information flow. In Proceedings of the 32nd ACM SIGPLAN Conference on Programming Language Design and Implementation, PLDI 2011, San Jose, CA, USA, June 4-8, 2011, Mary W. Hall and David A. Padua (Eds.). ACM, 109–120. https://doi.org/10.1145/1993498.1993512
- [28] Rachit Nigam, Samuel Thomas, Zhijing Li, and Adrian Sampson. 2021. A compiler infrastructure for accelerator generators. In ASPLOS '21: 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Virtual Event, USA, April 19-23, 2021, Tim Sherwood, Emery D. Berger, and Christos Kozyrakis (Eds.). ACM, 804–817. https://doi.org/10.1145/3445814.3446712
- [29] Rishiyur S. Nikhil. 2004. Bluespec System Verilog: efficient, correct RTL from high level specifications. In 2nd ACM & IEEE International Conference on Formal Methods and Models for Co-Design (MEMOCODE 2004), 23-25 June 2004, San Diego, California, USA, Proceedings. IEEE Computer Society, 69–70. https://doi.org/10.1109/MEMCOD.2004. 1459818
- [30] Martin Odersky. 2019. Scala Language Specification. https://scalalang.org/files/archive/spec/2.13/.
- [31] Martin Schoeberl. 2011. Leros: A Tiny Microcontroller for FPGAs. In International Conference on Field Programmable Logic and Applications, FPL 2011, September 5-7, Chania, Crete, Greece. IEEE Computer Society, 10–14. https://doi.org/10.1109/FPL.2011.13
- [32] Fabian Schuiki, Andreas Kurth, Tobias Grosser, and Luca Benini. 2020. LLHD: a multi-level intermediate representation for hardware description languages. In Proceedings of the 41st ACM SIGPLAN International Conference on Programming Language Design and Implementation, PLDI 2020, London, UK, June 15-20, 2020, Alastair F. Donaldson and Emina Torlak (Eds.). ACM, 258–271. https://doi.org/10.1145/3385412.3386024
- [33] Harald Søndergaard and Peter Sestoft. 1990. Referential transparency, definiteness and unfoldability. Acta Informatica 27 (1990), 505–517.
- [34] Yahui Song and Wei-Ngan Chin. 2021. A Synchronous Effects Logic for Temporal Verification of Pure Esterel. In Verification, Model Checking, and Abstract Interpretation - 22nd International Conference, VMCAI 2021, Copenhagen, Denmark, January 17-19, 2021, Proceedings (Lecture Notes in Computer Science, Vol. 12597), Fritz Henglein, Sharon Shoham, and Yakir Vizel (Eds.). Springer, 417–440. https://doi.org/10.1007/978-3-030-67067-2\_19
- [35] Lenny Truong and Pat Hanrahan. 2019. A Golden Age of Hardware Description Languages: Applying Programming Language Techniques to Improve Design Productivity. In 3rd Summit on Advances in Programming Languages, SNAPL 2019, May 16-17, 2019, Providence, RI, USA (LIPIcs, Vol. 136), Benjamin S. Lerner, Rastislav Bodík, and Shriram Krishnamurthi (Eds.). Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 7:1–7:21. https://doi.org/10.4230/LIPIcs.SNAPL.2019.7

Implicit State Machines

[36] Philip Wadler. 1992. Monads for functional programming. In Program Design Calculi, Proceedings of the NATO Advanced Study Institute on Program Design Calculi, Marktoberdorf, Germany, July 28 - August 9, *1992 (NATO ASI Series, Vol. 118)*, Manfred Broy (Ed.). Springer, 233–264. https://doi.org/10.1007/978-3-662-02880-3\_8