

# Building the Infinite Brain

**COMP 590/790** 

Raghavendra Pradyumna Pothukuchi



## **Quick Review**

#### What are BCIs?

Broadly, systems that read, process, and modulate neural activity in the brain

#### Types of BCIs

Many, both invasive/implanted and wearable

#### What are BCIs used for?

Understanding brain function, treat dysfunction, restore lost function, and augment cognition

Intimately tied to an individual's identity

### What are the computational challenges in designing BCIs?

Growing sensor data needs to be processed on the device Computers today are limited in achieving both efficiency and flexibility, under safety constraints



## General Suggestions

#### Take notes

No need to repeat what's in the slides, but note connections, insights

### Write down ideas and follow-ups

Even if you don't plan to revisit them (for now)

### When reading, go down (good) rabbit holes

Trails of papers—builds background knowledge, but don't lose sight of the overall goal: Tree traversal!

#### Learn to refine your thoughts into formal and precise summaries, ideas

Even if you don't plan to revisit them (for now)

"Writing is nature's way of letting you know how sloppy your thinking is. Math is nature's way of letting you know how sloppy your writing is." Leslie Lamport



## More on The Project

### Working project

#### Examples

- An FPGA/microcontroller implementation of FFT and analysis of power and performance
- Software to improve/maintain a BCI tool or other software
  - BRAND: <a href="https://github.com/brandbci/brand">https://github.com/brandbci/brand</a>
  - Foresee: <a href="https://github.com/bci-foresee/foresee">https://github.com/bci-foresee/foresee</a>
- Verification (or Correctness) specification and analysis of a BCI hardware or software kernel
- An EEG BCI game of Pong
- An analysis of algorithms used for seizure detection and their suitability for on-device execution
- A compiler to translate a BCI application to fit microcontrollers
- Python to Verilog translation for DWT in BCI processor design
- Extension of a BCI processing architecture
- Evaluation of a machine learning algorithm for speech decoding and suggested re-design for on-device execution
- Leveraging neuroscientific markers for efficient movement intent decoding on device
- An analysis of cursor movement methods identifying key computational techniques and bottlenecks

••

You should argue for why your design (hardware, software) is good

You can search online for inspiration, but do not use someone else's code or work—recall the honor code

# Writing the Project Proposal



The Heilmeier Catechism <a href="https://www.darpa.mil/about/heilmeier-catechism">https://www.darpa.mil/about/heilmeier-catechism</a>

George H. Heilmeier, former DARPA director

2-page proposals due 9/3

Goal: What are you trying to do?

State of the art and challenges/limitations: How is it done today, and what are the limits of current practice?

Novelty and realizability: What is new in your approach, and why do you think it will be successful?

Significance and impact: Who cares? If you are successful, what difference will it make?

Executional challenges: What are the risks?

**Effort:** How much will it cost?

Timeline: How long will it take?



Metrics and deliverables: What are the midterm and final "exams" to check for success?

## Writing a Paper Review



The Task of the Referee <a href="http://www2.eecs.berkeley.edu/Pubs/TechRpts/1989/CSD-89-511.pdf">http://www2.eecs.berkeley.edu/Pubs/TechRpts/1989/CSD-89-511.pdf</a>

Alan Smith, Emeritus, UC Berkeley

Summarize the paper

What did you learn?

What are the novel contributions of the paper?

Positives, strengths etc.

Negatives, limitations

Summary and review of assumptions, methods, results (check correctness too!)

Significance and Impact (past, present or future)

When we shift to paper presentations:

1-2 questions for discussion!

Connect the paper to the themes of the course and other papers



# For Today

- Quick review
- Introductions
- What is computer architecture?
- How are computer architectures analyzed and compared?



### Introductions!

Name

Grad/undergrad

Focus area

Why are you taking the course?



# **Example Application**

### **Restoring function**



Computational cognitive frameworks



### Mitigating seizures





## BCI Computing is a Full Stack Problem





## What is Computer Architecture?

#### Previously, it meant the ISA

Most are register based (operands and results in registers), other variants include stack, accumulator

### Now, it could mean everything between the OS and circuit design!

Distinctions are made based on the context

### What we discuss is also called computer organization



Gene Amdahl



Fred Brooks Jr

"The term architecture is used here to describe the attributes of a system as seen by the programmer, i.e., the conceptual structure and functional behavior, as distinct from the organization of the data flow and controls, the logical design, and the physical implementation."

Architecture of the IBM System/360, Apr 1964





## Goals of A Computer Architect



Reading task: "Hints and Principles for Computer System Design (2021)" https://arxiv.org/abs/2011.02455

#### **STEADY** by **AID** with **ART**

Butler Lampson, Pioneer in computer system design

Applicable for BCI computers too!

Simple Adaptable Dependable **Timely** Efficient Yummy Goals

Will it sell? Is it fast? Can it evolve? Does it work? Is it clean? Is it ready?

Incremental

**Approximate** Speculate,

Independent, iterate... compress...

Divide and conquer

Abstractions, concurrent...

Techniques

Process Techniques Architecture Automate Review Test

What? STEADY — Simple, Timely, Efficient, Adaptable, Dependable, Yummy Goals

**Techniques** by AID — Approximate, Incremental, Divide & Conquer How?

with ART —Architecture, Automate, Review, Techniques, Test When, who? **Process** 



## Measuring Goals

Simple Timely Efficient Adaptable Dependable Yummy

Time-to-market

Latency

Throughput

Energy

Throughput per watt

Energy per bit

Power

Energy×Delay (ED or EDP)

Quality of service

Tail latency

Uptime

Downtime

Mean time to failure

Mean time between failures



## Measuring Execution Time

Execution time = 
$$\frac{\text{instructions}}{\text{program}} \times \frac{\text{cycles}}{\text{instruction}} \times \frac{\text{time}}{\text{cycle}}$$

$$\frac{Path \ length}{CPI\left(=\frac{1}{IPC}\right)} \ period\left(=\frac{1}{f}\right)$$



# Measuring Power

 $\mathbf{Power} = \mathbf{Power}_{static} + \mathbf{Power}_{dynamic}$ 

 $Power_{dynamic} \propto Activity \times Capacitance \times Voltage^2 \times frequency$ 

**Power**<sub>static</sub>  $\propto$  Voltage  $\times$  Current



## Improving Efficiency

First get it right, and then, get it fast

Efficiency of resources—what are they?

Computing, storage, communication, power, energy

### Concept of a bottleneck

The part of the system that consumes the most resources

E.g., single server under contention, sequential execution, administration



### Amdahl's Law



"Validity of the Single Processor Approach to Achieving Large Scale Computing Capabilities", AFIPS 1967

$$T_{\text{orig}} = T_{\text{seq}} + T_{\text{par}}$$

$$s = \frac{T_{seq}}{T_{orig}}, f = \frac{T_{par}}{T_{orig}}, p \text{ is speedup of } T_{par}$$

$$T_{new} = T_{seq} + \frac{T_{par}}{p} = T_{orig} - T_{par} + \frac{T_{par}}{p}$$

There may be more!

Overall speedup = 
$$\frac{T_{\text{orig}}}{T_{\text{new}}} = \frac{T_{\text{orig}}}{T_{orig} - T_{par} + \frac{T_{par}}{p}} = \frac{1}{(1-f) + \frac{f}{p}} = \frac{1}{s + \frac{f}{p}}$$



Bottleneck!

### Amdahl's Law Class Exercise





### How to Fix a Bottleneck?

#### First find it!

#### Change the algorithm

The previous bottleneck no longer applies—e.g., concurrency

### Find a fast path

Make the bottleneck rare—normal case is fast, and some progress is made for the worst case

Slow path time,  $T_s$  and Fast path time,  $T_f$ Slow path probability,  $p \ll 1$ Mean time  $\langle T \rangle \approx pT_s + T_f$ 

How slow is this over the fast path (the ceiling)?

$$\frac{\left(pT_S + T_f\right)}{T_f} = 1 + p\left(\frac{T_S}{T_f}\right)$$

How fast is this over the slow path (the floor)?

$$\frac{T_S}{pT_S + T_f} = \frac{1}{p + \frac{T_f}{T_S}}$$

What can we say about the maximum speedup based on Amdahl's law?



# Example from a BCI Processor (SCALO)





### Rewriting Applications to Filter Communication

### Locality sensitive hashing

Euclidean Distance: O(N)

Cross correlation: O(kN)

Dynamic Time Warping:  $O(N^2)$ 

Earth mover's distance: O(N<sup>3</sup>log N)



**Equality check!** 



## Rewriting Applications to Filter Communication





## More ways to Improve Efficiency

### **Exploit technology**

e.g., at 12 nm, area not a key constraint for BCIs.

NVMs are a feasibility

### **Approximate**

Filter, compress, sample, precision, stale data

Predict

Relax synchronization, "correctness" (expectations of correctness)—e.g., HogWild! NeurIPS 2011—but, know how correct you are!

### Locality

Keep data small, keep close

#### Regroup

Batch, when one-time costs are high and non-recurring—e.g., cache data block access

Fragment, when it enables concurrency—
e.g., pipelining, streaming, sharding.

Independent components

Centralized vs distributed



### Takeaways

1-page write-up due in class on 8/272-page project proposals due 9/3

### What is computer architecture?

Historically, the ISA; but now encompasses organization

#### What are the goals?

Mnemonic: Simple Timely Efficient Adaptable Dependable Yummy

### How to estimate impact of fixing bottlenecks?

Amdahl's law

#### How to fix bottlenecks?

Algorithms, adding a fast path

### How to improve efficiency?

Technology, approximation, locality, regroup



# Image Credits (Educational, Fair Use)

- Title image: VLADGRIN, <a href="https://www.istockphoto.com/vector/human\_-machine-gm147409511-16840728">https://www.istockphoto.com/vector/human\_-machine-gm147409511-16840728</a> (Educational fair use)
- Infinite brain: Science wonder stories, May 1930, Illustrator: Frank R Paul, Editor: Hugo Gernsback
- Brain color, ICs, cloud server, black rat: No attribution required (Hiclipart)
- Hand with spoon: public domain freepng
- Signals: <a href="https://www.nature.com/articles/nrn3724">https://www.nature.com/articles/nrn3724</a>
- Thought clouds: F. Willett et al./*Nature* 2021/Erika Woodrum, <a href="https://med.stanford.edu/neurosurgery/news/2022/bci-award">https://med.stanford.edu/neurosurgery/news/2022/bci-award</a>. https://www.the-scientist.com/news-opinion/brain-computer-interface-user-types-90-characters-per-minute-with-mind-68762
- Picture of scientists: <a href="https://www.cs.auckland.ac.nz/~brian/rutherford8.html">https://www.cs.auckland.ac.nz/~brian/rutherford8.html</a> (original: Pierre de Latil), Bush (Carnegie Science), Others (Wikipedia and National Academies)
- Flowchart: Pause08 flaticon.com
- Digital brain: Smashicons flaticon.com
- Server rack: upklyak freepik.com
- Quantum processor icons created by Paul J. Flaticon
- Arm, Lotus: Adobe stock
- Quantum processor: Rigetti computing
- Images of implanted users: Top: Case Western Reserve University (<a href="https://thedaily.case.edu/man-quadriplegia-employs-injury-bridging-technologies-move-just-thinking/">https://thedaily.case.edu/man-quadriplegia-employs-injury-bridging-technologies-move-just-thinking/</a>), Bottom: Jan Scheuermann (University of Pittsburgh/UPMC; <a href="https://www.upmc.com/media/news/bci-press-release-chocolate">https://www.upmc.com/media/news/bci-press-release-chocolate</a>)
- Images of wearable BCIs: Cognizion, NextMind
- Types of BCIs: "Brain-computer interfaces for communication and rehabilitation,
- Illustrative BCI: Neuralink
- Electrodes: "Electrochemical and electrophysiological considerations for clinical high channel count neural interfaces", Vatsyayan et al.
- Form factors: Neuropace, Medtronic, Bloomberg, "Fully Implanted Brain-Computer Interface in a Locked-In Patient with ALS" by Vansteensel et al., Blackrock Neurotech
- Jose Delgado's video: Online, various sources (CNN, Youtube)
- Video of Kennedy and Ramsey: Online, various sources (Youtube, Neural signals)

