Broadcasting Examples

3.2. Broadcasting Examples#

General broadcasting rules (direcetly taken from NumPy website)

When operating on two arrays, NumPy compares their shapes element-wise. It starts with the trailing (i.e. rightmost) dimension and works its way left. Two dimensions are compatible when

they are equal, or
one of them is 1.

Here is an example. Generate two arrays \(a\) and \(b\), of sizes 4 and 3, respectively, from NumPy’s random.randint() function. Subtract each element of \(a\) from each element of \(b\). The solution to this problem is to create a new “dummy axis” for each array. Here is the code using broadcasting compared to the same result using loops.

import numpy as np

a = np.random.randint(0,10, 4)
print(a) 
b = np.random.randint(0,10, 3) 
print(b) 

[0 9 8 8]
[7 2 2]

a_new = a[np.newaxis,:]
b_new = b[:, np.newaxis] 
result = a_new - b_new
result

array([[-7,  2,  1,  1],
       [-2,  7,  6,  6],
       [-2,  7,  6,  6]])

# compare result to solution via loop
c = np.zeros((b.size, a.size))
for i in range(b.size): 
    for j in range(a.size): 
        c[i,j] = a[j] - b[i]
c

array([[-7.,  2.,  1.,  1.],
       [-2.,  7.,  6.,  6.],
       [-2.,  7.,  6.,  6.]])

Here is a broadcasting example (it comes from a common machine learning algorithm, btw).

Suppose \(X\) is an \(n \times m\) array and \(c\) is an \(l \times m\) array. I would like to subtract all \(n\) rows of \(X\) from all \(l\) rows of \(c\). How can I do that? Below I generate random data and show how to do this utilizing broadcasting.

To generate data, I will use the NumPy random.rand() function, which generates random numbers from 0 to 1.

# shapes of the arrays
n=15
l=4
m=3

# data for X

X = np.random.rand(n,m)
print(X)
print(X.shape) 

[[0.01780382 0.18167292 0.46624413]
 [0.01995624 0.75470246 0.88746441]
 [0.20557532 0.15009887 0.74720668]
 [0.54580998 0.44169049 0.14994426]
 [0.21565274 0.25119432 0.62826228]
 [0.72950583 0.36981018 0.61794542]
 [0.40151773 0.99680333 0.65861836]
 [0.18626181 0.43733894 0.7520234 ]
 [0.70220766 0.11191155 0.46925418]
 [0.45410919 0.4293809  0.01132849]
 [0.58821175 0.25254319 0.05853927]
 [0.5612542  0.69062168 0.42842617]
 [0.47167694 0.24370346 0.24385397]
 [0.24659046 0.96082418 0.50899194]
 [0.17179814 0.62224707 0.76429367]]
(15, 3)

# data for c

c = np.random.rand(l,m)
print(c)
print(c.shape) 

[[0.51571888 0.71021182 0.56532302]
 [0.03290376 0.51387757 0.17119873]
 [0.9969326  0.72860408 0.87767288]
 [0.24907407 0.12274001 0.2735593 ]]
(4, 3)

To subtract the two, I need to create a new axis for each array. You want to subtract every row of \(X\) from every row of \(c\), creating all possible pairwise subtractions. The solution is to use NumPy broadcasting by adding extra dimensions to each array:

Reshape \(X\) from (n,m) to (n,1,m) - adds a “dummy” dimension
Reshape \(c\) from (l,m) to (1,l,m) - adds a “dummy” dimension

NumPy will automatically “broadcasts” (repeats) the arrays along the dummy dimensions:

X gets repeated l times along the middle dimension c gets repeated n times along the first dimension Result is (n,l,m) where entry [i,j,:] = c[j,:] - X[i,:]

X_new = X[:, np.newaxis, :]
X_new.shape

(15, 1, 3)

c_new = c[np.newaxis, :,:]
c_new.shape

(1, 4, 3)

result= X_new - c_new

result.shape

(15, 4, 3)