Cornell Virtual Workshop > AI with Deep Learning > Resources

Exercise

This exercise is inspired by the Demonstration provided by Killian Weinberger of the Cornell University Department of Computer Science in this video from a 2021 Quantum Matter Summer School. The demo is a small part of a much larger video, and the link above goes to that chapter of the video. But you might want to back up a chapter or two to see Killian's explanation of what the demo is doing. Or you might want to watch the entire thing, since it's very interesting and informative!

As noted in the preceding introductory material, neural networks are basically learning to map inputs to outputs. They are function approximators. And this demo that Killian Weinberger and Paul Ginsparg constructed does a nice job of demonstrating that learning process, in the relatively simple example of a regression to a set of data points in two dimensions. Their demo is much more compelling than the exercise included below — in its use of interactive graphics, real-time model updating, and auxiliary graphics that demonstrate the structure and status of the neural network. But the exercise below is at least a start in creating something along those lines, and perhaps a substrate for you to work with if you aspire to building something like what is shown in the video.

There are a few points worth making about how this exercise differs from the one shown in the video:

This exercise uses TensorFlow and Keras to build a neural network, whereas it appears that the demo in the video encodes the network and its action directly using the numpy library. The architecture of the TensorFlow/Keras network model, however, is intended to mimic that displayed in the video.
This exercise generates a set of points to fit programmatically, rather than allowing the user to click in a plot window and have the points appear.
Because this exercise does not generate animated graphics like in the video, it instead does a sequence of fits, and plots the current fit at regular intervals in the training process. Therefore, there are multiple fitting curves drawn, which presumably improve in their approximation of the data as the fitting progresses, and converge to a best fit during the later stages.
You can download and run the exercise code below, assuming you have the proper software environment configured, which in addition to tensorflow would also require matplotlib for plotting. It's good to run it multiple times, since both the points generated differ randomly from run to run, as do the initial parameters assigned to the edges and nodes of the network. If you run it multiple times, you can see that sometimes it does a good job of fitting the data, and sometimes it does not. (An example of such of fit is shown in the figure below. The demo in the video always seems to do a good job, given a sufficient number of nodes in the hidden layer.) You could experiment with different aspects of the fitting code to see if different choices of optimization algorithms makes a difference in the outcome.

Sample fit to data — The fit of a neural network to a set of data points, plotted at regular intervals en route to converging to a best fit

So feel free to download and run the code below, and consider some of the points raised above. You can modify the code to provide a different number of data points, or to use a different number of nodes in the hidden layer, or whatever. Enjoy!

import numpy as np
import matplotlib
import matplotlib.pyplot as plt
import tensorflow as tf
import math

# set INTERACTIVE=True if you want an interactive matplotlib backend
# set INTERACTIVE=False if you want the non-interactive agg backend, useful for just saving the plot to a file

INTERACTIVE = True

if not INTERACTIVE:
    matplotlib.use('agg')

Nx = 21   # number of data points
Nn = 20   # number of neurons in hidden layer

dx = (10-(-10))/Nx

# Generate some random data points

x = np.linspace(-10, 10, num=Nx) + 0.1*dx*np.random.normal(size=Nx)
y = 10*0.1*x*np.cos(x) + np.random.normal(size=Nx)

# Create a model with one hidden layer

model = tf.keras.Sequential()
model.add(tf.keras.layers.Dense(units = 1, activation = 'linear', input_shape=[1]))
model.add(tf.keras.layers.Dense(units = Nn, activation = 'relu'))
model.add(tf.keras.layers.Dense(units = 1, activation = 'linear'))
model.compile(loss='mse', optimizer="adam")

# Print summary of the model
model.summary()

# Plot the data points

xc = np.linspace(-10., 10., num=100)
plt.figure(figsize=(8,8))
plt.scatter(x[::1], y[::1], s=10, c='k')
plt.grid()

total_ep = 0
last_ep = 0

Npass = 20
nepoc = 2000

# Fit for Npass*nepoc epochs, plotting the current fit after every nepoc epochs

for i in range(Npass):
    model.fit(x, y, epochs=nepoc, verbose=0)
    y_predicted = model.predict(xc)
    plt.plot(xc, y_predicted)

    alpha = 0.3
    width = 1
    if i == Npass-1:
        alpha = 1.0
        width = 2
    plt.plot(xc, y_predicted, label=i, alpha=alpha, linewidth=width)

plt.legend()
ranno = np.random.randint(100000, 999999)
plt.savefig(f'regression1d_fit_{ranno}.png')

if INTERACTIVE:
    plt.show()

Back