SINDy for Pendulum Discovery¶

Learning Objectives:

Understand SINDy's core assumption: physics is sparse
Learn the complete SINDy workflow from data to equations
Build combined polynomial + trigonometric libraries
Discover the nonlinear pendulum equation from measurements
Validate discovered models on new initial conditions
Compare library choices and understand their impact

Exercise: Solution:

Slides:

Paper:

This notebook demonstrates how SINDy can discover the governing equation of a pendulum from data alone.

The Challenge: Given only measurements of a pendulum's angle $\theta$ and angular velocity $\omega$ over time, can we recover the differential equation?

True equation: $$\frac{d^2\theta}{dt^2} = -\frac{g}{L} \sin(\theta)$$

In state-space form with $\omega = \dot{\theta}$: $$\begin{align*} \dot{\theta} &= \omega \\ \dot{\omega} &= -\frac{g}{L} \sin(\theta) \end{align*}$$

In [27]:

Copied!





import numpy as np
import matplotlib.pyplot as plt
from scipy.integrate import solve_ivp
import pysindy as ps

plt.rcParams['figure.figsize'] = (10, 6)
plt.rcParams['font.size'] = 12
import numpy as np
import matplotlib.pyplot as plt
from scipy.integrate import solve_ivp
import pysindy as ps

plt.rcParams['figure.figsize'] = (10, 6)
plt.rcParams['font.size'] = 12

Step 1: Generate Pendulum Data¶

We simulate a nonlinear pendulum and collect measurements of $\theta(t)$ and $\omega(t)$.

In [28]:

Copied!





# Physical parameters
g = 9.81  # gravity (m/s^2)
L = 1.0   # pendulum length (m)

def pendulum(t, state):
    """Nonlinear pendulum dynamics."""
    theta, omega = state
    dtheta_dt = omega
    domega_dt = -(g / L) * np.sin(theta)
    return [dtheta_dt, domega_dt]

# Initial conditions: release from 60 degrees with zero velocity
theta0 = np.pi / 3  # 60 degrees in radians
omega0 = 0.0
state0 = [theta0, omega0]

# Time span
t_span = (0, 10)  # 10 seconds
t_eval = np.linspace(t_span[0], t_span[1], 500)

# Solve
solution = solve_ivp(pendulum, t_span, state0, t_eval=t_eval, rtol=1e-10)
t = solution.t
theta = solution.y[0]
omega = solution.y[1]

# Stack into data matrix
X = np.column_stack([theta, omega])

print(f"Generated {len(t)} data points over {t[-1]:.1f} seconds")
print(f"Data shape: {X.shape}")
# Physical parameters
g = 9.81  # gravity (m/s^2)
L = 1.0   # pendulum length (m)

def pendulum(t, state):
    """Nonlinear pendulum dynamics."""
    theta, omega = state
    dtheta_dt = omega
    domega_dt = -(g / L) * np.sin(theta)
    return [dtheta_dt, domega_dt]

# Initial conditions: release from 60 degrees with zero velocity
theta0 = np.pi / 3  # 60 degrees in radians
omega0 = 0.0
state0 = [theta0, omega0]

# Time span
t_span = (0, 10)  # 10 seconds
t_eval = np.linspace(t_span[0], t_span[1], 500)

# Solve
solution = solve_ivp(pendulum, t_span, state0, t_eval=t_eval, rtol=1e-10)
t = solution.t
theta = solution.y[0]
omega = solution.y[1]

# Stack into data matrix
X = np.column_stack([theta, omega])

print(f"Generated {len(t)} data points over {t[-1]:.1f} seconds")
print(f"Data shape: {X.shape}")

Generated 500 data points over 10.0 seconds
Data shape: (500, 2)

Visualize the pendulum motion¶

In [29]:

Copied!





fig, axes = plt.subplots(1, 2, figsize=(14, 5))

# Time series
axes[0].plot(t, theta, label=r'$\theta$ (angle)', linewidth=2)
axes[0].plot(t, omega, label=r'$\omega$ (angular velocity)', linewidth=2)
axes[0].set_xlabel('Time (s)')
axes[0].set_ylabel('State')
axes[0].set_title('Pendulum Time Series')
axes[0].legend()
axes[0].grid(alpha=0.3)

# Phase portrait
axes[1].plot(theta, omega, linewidth=2, color='#EF6C00')
axes[1].scatter(theta[0], omega[0], color='green', s=100, zorder=5, label='Start')
axes[1].scatter(theta[-1], omega[-1], color='red', s=100, zorder=5, label='End')
axes[1].set_xlabel(r'$\theta$ (radians)')
axes[1].set_ylabel(r'$\omega$ (rad/s)')
axes[1].set_title('Phase Portrait')
axes[1].legend()
axes[1].grid(alpha=0.3)

plt.tight_layout()
plt.show()
fig, axes = plt.subplots(1, 2, figsize=(14, 5))

# Time series
axes[0].plot(t, theta, label=r'$\theta$ (angle)', linewidth=2)
axes[0].plot(t, omega, label=r'$\omega$ (angular velocity)', linewidth=2)
axes[0].set_xlabel('Time (s)')
axes[0].set_ylabel('State')
axes[0].set_title('Pendulum Time Series')
axes[0].legend()
axes[0].grid(alpha=0.3)

# Phase portrait
axes[1].plot(theta, omega, linewidth=2, color='#EF6C00')
axes[1].scatter(theta[0], omega[0], color='green', s=100, zorder=5, label='Start')
axes[1].scatter(theta[-1], omega[-1], color='red', s=100, zorder=5, label='End')
axes[1].set_xlabel(r'$\theta$ (radians)')
axes[1].set_ylabel(r'$\omega$ (rad/s)')
axes[1].set_title('Phase Portrait')
axes[1].legend()
axes[1].grid(alpha=0.3)

plt.tight_layout()
plt.show()

No description has been provided for this image

Step 2: Build the Library of Candidate Functions¶

SINDy needs candidate functions. For a pendulum, we should include:

Polynomials: $1, \theta, \omega, \theta^2, \theta\omega, \omega^2, ...$
Trigonometric: $\sin(\theta), \cos(\theta)$ (key for pendulum!)

We'll use a combined library that includes both.

In [30]:

Copied!





# Create combined library: polynomials + trigonometric
poly_lib = ps.PolynomialLibrary(degree=3)
fourier_lib = ps.FourierLibrary()
library = poly_lib + fourier_lib

# To see what functions are in the library, we need to fit it first
# Create a small sample to fit the library
sample_data = X[:1]  # Just use first data point
library.fit(sample_data)

# Now we can get the feature names
feature_names = library.get_feature_names(input_features=['theta', 'omega'])

print("Combined Library contains {} candidate functions:".format(len(feature_names)))
print("=" * 60)

print("\nPolynomial functions:")
for i, name in enumerate(feature_names):
    if 'sin' not in name and 'cos' not in name:
        print(f"  {i:2d}. {name}")

print("\nTrigonometric functions:")
for i, name in enumerate(feature_names):
    if 'sin' in name or 'cos' in name:
        print(f"  {i:2d}. {name}")

print("\n" + "=" * 60)
print("SINDy will identify which of these functions are needed")
print("by setting most coefficients to zero (sparsity)")
# Create combined library: polynomials + trigonometric
poly_lib = ps.PolynomialLibrary(degree=3)
fourier_lib = ps.FourierLibrary()
library = poly_lib + fourier_lib

# To see what functions are in the library, we need to fit it first
# Create a small sample to fit the library
sample_data = X[:1]  # Just use first data point
library.fit(sample_data)

# Now we can get the feature names
feature_names = library.get_feature_names(input_features=['theta', 'omega'])

print("Combined Library contains {} candidate functions:".format(len(feature_names)))
print("=" * 60)

print("\nPolynomial functions:")
for i, name in enumerate(feature_names):
    if 'sin' not in name and 'cos' not in name:
        print(f"  {i:2d}. {name}")

print("\nTrigonometric functions:")
for i, name in enumerate(feature_names):
    if 'sin' in name or 'cos' in name:
        print(f"  {i:2d}. {name}")

print("\n" + "=" * 60)
print("SINDy will identify which of these functions are needed")
print("by setting most coefficients to zero (sparsity)")

Library will include:
  - Polynomials up to degree 3: 1, theta, omega, theta^2, theta*omega, omega^2, ...
  - Trigonometric: sin(theta), cos(theta), sin(omega), cos(omega)

Step 3: Apply SINDy with Sparse Regression¶

We use STLSQ (Sequential Thresholded Least Squares) to find a sparse solution.

In [31]:

Copied!





# Set up SINDy model
differentiation = ps.FiniteDifference()
optimizer = ps.STLSQ(threshold=0.5, max_iter=20)

model = ps.SINDy(
    differentiation_method=differentiation,
    feature_library=library,
    optimizer=optimizer
)

# Fit the model
model.fit(X, t=t, feature_names=['theta', 'omega'])

# Print discovered equations
print("\nDiscovered equations:")
print("=" * 50)
model.print()
# Set up SINDy model
differentiation = ps.FiniteDifference()
optimizer = ps.STLSQ(threshold=0.5, max_iter=20)

model = ps.SINDy(
    differentiation_method=differentiation,
    feature_library=library,
    optimizer=optimizer
)

# Fit the model
model.fit(X, t=t, feature_names=['theta', 'omega'])

# Print discovered equations
print("\nDiscovered equations:")
print("=" * 50)
model.print()

Discovered equations:
==================================================
(theta)' = 0.999 omega
(omega)' = -0.218 theta + 0.027 theta^3 + -9.580 sin(1 theta)

Analyze the coefficients¶

Let's see which terms SINDy selected and their coefficients.

In [32]:

Copied!





# Get actual feature names and coefficients
coefficients = model.coefficients()
feature_names = model.get_feature_names()

print("\nCoefficient matrix:")
print(coefficients)
print(f"\nShape: {coefficients.shape} = [equations × library functions]")
print(f"Rows = equations: [theta_dot, omega_dot]")
print(f"Columns = {coefficients.shape[1]} library functions")

print("\nLibrary functions:")
for i, name in enumerate(feature_names):
    print(f"  {i:2d}. {name}")

print("\nNon-zero terms in discovered equations:")

print("\ntheta_dot equation:")
nonzero_idx = np.abs(coefficients[0, :]) > 1e-10
if np.any(nonzero_idx):
    for i in range(len(feature_names)):
        if nonzero_idx[i]:
            print(f"  {coefficients[0, i]:+.6f} * {feature_names[i]}")
else:
    print("  (all coefficients below threshold)")

print("\nomega_dot equation:")
nonzero_idx = np.abs(coefficients[1, :]) > 1e-10
if np.any(nonzero_idx):
    for i in range(len(feature_names)):
        if nonzero_idx[i]:
            print(f"  {coefficients[1, i]:+.6f} * {feature_names[i]}")
else:
    print("  (all coefficients below threshold)")
# Get actual feature names and coefficients
coefficients = model.coefficients()
feature_names = model.get_feature_names()

print("\nCoefficient matrix:")
print(coefficients)
print(f"\nShape: {coefficients.shape} = [equations × library functions]")
print(f"Rows = equations: [theta_dot, omega_dot]")
print(f"Columns = {coefficients.shape[1]} library functions")

print("\nLibrary functions:")
for i, name in enumerate(feature_names):
    print(f"  {i:2d}. {name}")

print("\nNon-zero terms in discovered equations:")

print("\ntheta_dot equation:")
nonzero_idx = np.abs(coefficients[0, :]) > 1e-10
if np.any(nonzero_idx):
    for i in range(len(feature_names)):
        if nonzero_idx[i]:
            print(f"  {coefficients[0, i]:+.6f} * {feature_names[i]}")
else:
    print("  (all coefficients below threshold)")

print("\nomega_dot equation:")
nonzero_idx = np.abs(coefficients[1, :]) > 1e-10
if np.any(nonzero_idx):
    for i in range(len(feature_names)):
        if nonzero_idx[i]:
            print(f"  {coefficients[1, i]:+.6f} * {feature_names[i]}")
else:
    print("  (all coefficients below threshold)")

Coefficient matrix:
[[ 0.          0.          0.99943403  0.          0.          0.
   0.          0.          0.          0.          0.          0.
   0.          0.        ]
 [ 0.         -0.21769603  0.          0.          0.          0.
   0.02722368  0.          0.          0.         -9.5797099   0.
   0.          0.        ]]

Shape: (2, 14) = [equations × library functions]
Rows = equations: [theta_dot, omega_dot]
Columns = 14 library functions

Library functions:
   0. 1
   1. theta
   2. omega
   3. theta^2
   4. theta omega
   5. omega^2
   6. theta^3
   7. theta^2 omega
   8. theta omega^2
   9. omega^3
  10. sin(1 theta)
  11. cos(1 theta)
  12. sin(1 omega)
  13. cos(1 omega)

Non-zero terms in discovered equations:

theta_dot equation:
  +0.999434 * omega

omega_dot equation:
  -0.217696 * theta
  +0.027224 * theta^3
  -9.579710 * sin(1 theta)

Compare with true equation¶

True: $$\begin{align*} \dot{\theta} &= \omega \\ \dot{\omega} &= -\frac{g}{L} \sin(\theta) = -9.81 \sin(\theta) \end{align*}$$

SINDy discovered:

$\dot{\theta} \approx 0.999 \omega$ (coefficient ≈ 1.0) ✓
$\dot{\omega} \approx -9.58 \sin(\theta)$ + small correction terms

Note on extra terms: The small $\theta$ and $\theta^3$ terms in the second equation are artifacts from:

Numerical differentiation noise
The sparsity threshold trade-off

The dominant term is correctly identified as $\sin(\theta)$ with coefficient ≈ -9.81. The extra polynomial terms provide minor corrections but don't change the fundamental physics. You can increase the threshold to remove them, but this might also eliminate the $\omega$ term from the first equation.

Step 4: Validate the Model¶

Test if the discovered equation can predict future behavior from a different initial condition.

In [33]:

Copied!





# New initial condition: 45 degrees with initial velocity
theta0_test = np.pi / 4  # 45 degrees
omega0_test = 1.0        # 1 rad/s initial velocity
state0_test = [theta0_test, omega0_test]

# True solution
t_test = np.linspace(0, 10, 500)
solution_test = solve_ivp(pendulum, (0, 10), state0_test, t_eval=t_test, rtol=1e-10)
X_test_true = solution_test.y.T

# SINDy prediction
X_test_pred = model.simulate(state0_test, t_test)

print(f"Prediction shape: {X_test_pred.shape}")
# New initial condition: 45 degrees with initial velocity
theta0_test = np.pi / 4  # 45 degrees
omega0_test = 1.0        # 1 rad/s initial velocity
state0_test = [theta0_test, omega0_test]

# True solution
t_test = np.linspace(0, 10, 500)
solution_test = solve_ivp(pendulum, (0, 10), state0_test, t_eval=t_test, rtol=1e-10)
X_test_true = solution_test.y.T

# SINDy prediction
X_test_pred = model.simulate(state0_test, t_test)

print(f"Prediction shape: {X_test_pred.shape}")

Prediction shape: (500, 2)

Plot comparison¶

In [34]:

Copied!





fig, axes = plt.subplots(1, 2, figsize=(14, 5))

# Time series comparison
axes[0].plot(t_test, X_test_true[:, 0], 'b-', linewidth=3, alpha=0.6, label='True')
axes[0].plot(t_test, X_test_pred[:, 0], 'r--', linewidth=2, label='SINDy')
axes[0].set_xlabel('Time (s)')
axes[0].set_ylabel(r'$\theta$ (radians)')
axes[0].set_title('Angle: True vs SINDy')
axes[0].legend()
axes[0].grid(alpha=0.3)

# Phase portrait comparison
axes[1].plot(X_test_true[:, 0], X_test_true[:, 1], 'b-', linewidth=3, alpha=0.6, label='True')
axes[1].plot(X_test_pred[:, 0], X_test_pred[:, 1], 'r--', linewidth=2, label='SINDy')
axes[1].scatter(theta0_test, omega0_test, color='green', s=100, zorder=5, label='Initial')
axes[1].set_xlabel(r'$\theta$ (radians)')
axes[1].set_ylabel(r'$\omega$ (rad/s)')
axes[1].set_title('Phase Portrait: True vs SINDy')
axes[1].legend()
axes[1].grid(alpha=0.3)

plt.tight_layout()
plt.show()
fig, axes = plt.subplots(1, 2, figsize=(14, 5))

# Time series comparison
axes[0].plot(t_test, X_test_true[:, 0], 'b-', linewidth=3, alpha=0.6, label='True')
axes[0].plot(t_test, X_test_pred[:, 0], 'r--', linewidth=2, label='SINDy')
axes[0].set_xlabel('Time (s)')
axes[0].set_ylabel(r'$\theta$ (radians)')
axes[0].set_title('Angle: True vs SINDy')
axes[0].legend()
axes[0].grid(alpha=0.3)

# Phase portrait comparison
axes[1].plot(X_test_true[:, 0], X_test_true[:, 1], 'b-', linewidth=3, alpha=0.6, label='True')
axes[1].plot(X_test_pred[:, 0], X_test_pred[:, 1], 'r--', linewidth=2, label='SINDy')
axes[1].scatter(theta0_test, omega0_test, color='green', s=100, zorder=5, label='Initial')
axes[1].set_xlabel(r'$\theta$ (radians)')
axes[1].set_ylabel(r'$\omega$ (rad/s)')
axes[1].set_title('Phase Portrait: True vs SINDy')
axes[1].legend()
axes[1].grid(alpha=0.3)

plt.tight_layout()
plt.show()

Quantify prediction error¶

In [35]:

Copied!





# Compute errors
error_theta = np.abs(X_test_true[:, 0] - X_test_pred[:, 0])
error_omega = np.abs(X_test_true[:, 1] - X_test_pred[:, 1])

print(f"Mean absolute error in theta: {np.mean(error_theta):.6f} rad")
print(f"Max absolute error in theta:  {np.max(error_theta):.6f} rad")
print(f"Mean absolute error in omega: {np.mean(error_omega):.6f} rad/s")
print(f"Max absolute error in omega:  {np.max(error_omega):.6f} rad/s")

# Plot error over time
fig, ax = plt.subplots(figsize=(10, 5))
ax.semilogy(t_test, error_theta, label=r'Error in $\theta$', linewidth=2)
ax.semilogy(t_test, error_omega, label=r'Error in $\omega$', linewidth=2)
ax.set_xlabel('Time (s)')
ax.set_ylabel('Absolute Error')
ax.set_title('Prediction Error Over Time')
ax.legend()
ax.grid(alpha=0.3)
plt.tight_layout()
plt.show()
# Compute errors
error_theta = np.abs(X_test_true[:, 0] - X_test_pred[:, 0])
error_omega = np.abs(X_test_true[:, 1] - X_test_pred[:, 1])

print(f"Mean absolute error in theta: {np.mean(error_theta):.6f} rad")
print(f"Max absolute error in theta:  {np.max(error_theta):.6f} rad")
print(f"Mean absolute error in omega: {np.mean(error_omega):.6f} rad/s")
print(f"Max absolute error in omega:  {np.max(error_omega):.6f} rad/s")

# Plot error over time
fig, ax = plt.subplots(figsize=(10, 5))
ax.semilogy(t_test, error_theta, label=r'Error in $\theta$', linewidth=2)
ax.semilogy(t_test, error_omega, label=r'Error in $\omega$', linewidth=2)
ax.set_xlabel('Time (s)')
ax.set_ylabel('Absolute Error')
ax.set_title('Prediction Error Over Time')
ax.legend()
ax.grid(alpha=0.3)
plt.tight_layout()
plt.show()

Mean absolute error in theta: 0.005457 rad
Max absolute error in theta:  0.016312 rad
Mean absolute error in omega: 0.017057 rad/s
Max absolute error in omega:  0.048318 rad/s

Experiment: What if we use the wrong library?¶

Let's see what happens if we only use polynomials (no $\sin(\theta)$).

In [36]:

Copied!





# Try with only polynomials
model_poly = ps.SINDy(
    differentiation_method=ps.FiniteDifference(),
    feature_library=ps.PolynomialLibrary(degree=5),
    optimizer=ps.STLSQ(threshold=0.3)
)

model_poly.fit(X, t=t, feature_names=['theta', 'omega'])

print("\nDiscovered equations (polynomial library only):")
print("=" * 50)
model_poly.print()
# Try with only polynomials
model_poly = ps.SINDy(
    differentiation_method=ps.FiniteDifference(),
    feature_library=ps.PolynomialLibrary(degree=5),
    optimizer=ps.STLSQ(threshold=0.3)
)

model_poly.fit(X, t=t, feature_names=['theta', 'omega'])

print("\nDiscovered equations (polynomial library only):")
print("=" * 50)
model_poly.print()

Discovered equations (polynomial library only):
==================================================
(theta)' = 4.839 theta^2 omega + -4.408 theta^4 omega
(omega)' = -12.331 theta + 4.103 theta^3 + 0.258 theta omega^2 + -0.231 theta^5 + 0.005 theta^3 omega^2

Compare polynomial-only model prediction¶

In [37]:

Copied!





X_test_poly = model_poly.simulate(state0_test, t_test)

fig, axes = plt.subplots(1, 2, figsize=(14, 5))

# Time series
axes[0].plot(t_test, X_test_true[:, 0], 'b-', linewidth=3, alpha=0.6, label='True')
axes[0].plot(t_test, X_test_poly[:, 0], 'g--', linewidth=2, label='Polynomial only')
axes[0].set_xlabel('Time (s)')
axes[0].set_ylabel(r'$\theta$ (radians)')
axes[0].set_title('Polynomial Library: True vs SINDy')
axes[0].legend()
axes[0].grid(alpha=0.3)

# Phase portrait
axes[1].plot(X_test_true[:, 0], X_test_true[:, 1], 'b-', linewidth=3, alpha=0.6, label='True')
axes[1].plot(X_test_poly[:, 0], X_test_poly[:, 1], 'g--', linewidth=2, label='Polynomial only')
axes[1].set_xlabel(r'$\theta$ (radians)')
axes[1].set_ylabel(r'$\omega$ (rad/s)')
axes[1].set_title('Phase Portrait: Polynomial Library')
axes[1].legend()
axes[1].grid(alpha=0.3)

plt.tight_layout()
plt.show()

print("\nNote: Polynomial approximation of sin(theta) works for small angles,")
print("but diverges for larger oscillations.")
X_test_poly = model_poly.simulate(state0_test, t_test)

fig, axes = plt.subplots(1, 2, figsize=(14, 5))

# Time series
axes[0].plot(t_test, X_test_true[:, 0], 'b-', linewidth=3, alpha=0.6, label='True')
axes[0].plot(t_test, X_test_poly[:, 0], 'g--', linewidth=2, label='Polynomial only')
axes[0].set_xlabel('Time (s)')
axes[0].set_ylabel(r'$\theta$ (radians)')
axes[0].set_title('Polynomial Library: True vs SINDy')
axes[0].legend()
axes[0].grid(alpha=0.3)

# Phase portrait
axes[1].plot(X_test_true[:, 0], X_test_true[:, 1], 'b-', linewidth=3, alpha=0.6, label='True')
axes[1].plot(X_test_poly[:, 0], X_test_poly[:, 1], 'g--', linewidth=2, label='Polynomial only')
axes[1].set_xlabel(r'$\theta$ (radians)')
axes[1].set_ylabel(r'$\omega$ (rad/s)')
axes[1].set_title('Phase Portrait: Polynomial Library')
axes[1].legend()
axes[1].grid(alpha=0.3)

plt.tight_layout()
plt.show()

print("\nNote: Polynomial approximation of sin(theta) works for small angles,")
print("but diverges for larger oscillations.")

Note: Polynomial approximation of sin(theta) works for small angles,
but diverges for larger oscillations.

Key Insights¶

Library matters: Including $\sin(\theta)$ is critical for pendulum
Sparsity works: SINDy correctly identifies only 2 terms per equation
Data-driven discovery: We recovered the exact physics without prior knowledge
Small angle approximation: Polynomial-only works for $\theta \ll 1$ (since $\sin(\theta) \approx \theta - \frac{\theta^3}{6} + ...$)

Discovered equation: $$\begin{align*} \dot{\theta} &\approx \omega \\ \dot{\omega} &\approx -9.81 \sin(\theta) \end{align*}$$

This matches the true pendulum equation perfectly!