第2章 Python 数字图像处理(DIP) --数字图像基础3 - 图像内插 - 最近邻内插 - 双线性插值 - 双三次内插 - 图像放大

转载请注明出处

图像内插

内插通常在图像放大、缩小、旋转和几何校正等任务中使用。内插并用它来调整图像的大小（缩小和放大），缩小和放大基本上采用图像重取样方法

最近邻内插，这种方法将原图像中最近邻的灰度赋给了每个新位置，这种方法简单，但会产生我们不想要的人为失真，如严重的直边失真。更合适的方法是双线性内插，它使用4个最近邻的灰度来计算给定位置的灰度。令 $(x, y)$ 表示待赋灰度值的位置（可将它相像为前面描述的网格点）的坐标，令 $v(x, y)$ 表示灰度值。对于双线性内插方法，所赋的值由如下公式得到：
$v(x, y) = ax + by + cxy + d \tag{2.17}$
4 个系数可由点(x, y)的4个最近邻点写出的4个未知方程求出。双线性内插的结果要比最近邻内插的结果好得多，但计算量会随之增大。

def nearest_neighbor_interpolation(img, new_h, new_w):
    """
    get nearest_neighbor_interpolation for image, can up or down scale image into any ratio
    param: img: input image, grady image, 1 channel, shape like [512, 512]
    param: new_h: new image height 
    param: new_w: new image width
    return a nearest_neighbor_interpolation up or down scale image
    """
    new_img = np.zeros([new_h, new_w])
    src_height, src_width = img.shape[:2]
    r = new_h / src_height
    l = new_w / src_width
    for i in range(new_h):
        for j in range(new_w):
            x0 = int(i / r)
            y0 = int(j / l)
            new_img[i, j] = img[x0, y0]
    return new_img

# 最近邻插值法处理一通道的图像
img = cv2.imread('DIP_Figures/DIP3E_Original_Images_CH06/Fig0638(a)(lenna_RGB).tif', 0)

img_up = nearest_neighbor_interpolation(img, 1000, 1000)

plt.figure(figsize=(16, 8))
plt.subplot(121), plt.imshow(img, 'gray')
plt.subplot(122), plt.imshow(img_up, 'gray')
plt.tight_layout()
plt.show()

在这里插入图片描述

# 最近邻插值法处理RGB 3通过的图像
img = cv2.imread('DIP_Figures/DIP3E_Original_Images_CH06/Fig0638(a)(lenna_RGB).tif', -1)
img = img[..., ::-1]
img_up_1 = nearest_neighbor_interpolation(img[..., 0], 800, 800)
img_up_2 = nearest_neighbor_interpolation(img[..., 1], 800, 800)
img_up_3 = nearest_neighbor_interpolation(img[..., 2], 800, 800)
img_up = np.uint8(np.dstack((img_up_1, img_up_2, img_up_3)))

plt.figure(figsize=(16, 8))
plt.subplot(121), plt.imshow(img, 'gray')
plt.subplot(122), plt.imshow(img_up, 'gray')
plt.tight_layout()
plt.show()

在这里插入图片描述

双线性插值

又称为双线性内挺。在数学上，双线性插值是有两个变量的插值函数的线性插值扩展，其核心思想是在两个方向分别进行一次线性插值。

假设我们想得到未知函数 $f在点 P=(x,y)$ 的值，假设我们已知函数 $f在Q_{11} = (x_1, y_1)，Q_{12}=(x_1, y_2), Q_{21} = (x_2, y_1),Q_{22} = (x_2, y_2)$ 四个点的值。首先在x方向进行线性插值，得到：

$f(R_1) \approx \frac{x_2 - x}{x_2 - x_1} f(Q_{11}) + \frac{x-x_1}{x_2 - x_1} f(Q_{21}), where R_1 = (x, y_1)$
$f(R_2) \approx \frac{x_2 - x}{x_2 - x_1} f(Q_{12}) + \frac{x-x_1}{x_2 - x_1} f(Q_{22}), where R_2 = (x, y_2)$

然后在y方向进行线性插值，得到
$f(P) \approx \frac{y_2 - y}{y_2 - y_1} f(R_{1}) + \frac{y-y_1}{y_2 - y_1} f(R_{2})$

def bilinear_interpolation(img, new_h, new_w):
    """
    get nearest_neighbor_interpolation for image, can up or down scale image into any ratio
    param: img: input image, grady image, 1 channel, shape like [512, 512]
    param: new_h: new image height 
    param: new_w: new image width
    return a nearest_neighbor_interpolation up or down scale image
    """
    src_height, src_width = img.shape[:2]
    if new_h == src_height and new_w == src_width:
        return img.copy()
    
    new_img = np.zeros([new_h, new_w])
    for i in range(new_h):
        for j in range(new_w):
            # 首先要找到在原图中对应点的(x, y)
            x = (i+0.5) * src_height / new_h - 0.5
            y = (j+0.5) * src_width / new_w - 0.5
            
            # find the coordinates of the points which will be used to compute the interpolation
            src_x0 = int(np.floor(x))
            src_x1 = min(src_x0 + 1 , src_height - 1)
            src_y0 = int(np.floor(y))
            src_y1 = min(src_y0 + 1, src_width - 1)

            # calculate the interpolation
            temp0 = (src_x1 - x) * img[src_x0, src_y0] + (x - src_x0) * img[src_x1, src_y0]
            temp1 = (src_x1 - x) * img[src_x0, src_y1] + (x - src_x0) * img[src_x1, src_y1]
            new_img[i, j] = int((src_y1 - y) * temp0 + (y - src_y0) * temp1)
    return new_img

# 最近邻插值法处理一通道的图像
img = cv2.imread('DIP_Figures/DIP3E_Original_Images_CH06/Fig0638(a)(lenna_RGB).tif', 0)

img_up = bilinear_interpolation(img, 800, 800)

plt.figure(figsize=(16, 8))
plt.subplot(121), plt.imshow(img, 'gray')
plt.subplot(122), plt.imshow(img_up, 'gray')
plt.tight_layout()
plt.show()

在这里插入图片描述

双三次内插

双三次插值的目的就是通过找到一种关系，或者说系数，可以把这16个像素对于P处像素值的影响因子找出来，从而根据这个影响因子来获得目标图像对应点的像素值，达到图像缩放的目的。

Bicubic基函数形式如下：
$W(x) =\begin{cases} (a+2) |x|^3 - (a+3)|x|^2 + 1, & |x| \leq 1 \\a|x|^3 -5a|x|^2 + 8a|x| -4a, & 1 < |x| < 2 \\ 0, & \text{otherwise}\end{cases}$

$a=-1$
注 $a=-0.5$ 才能完美的实现内插

插值计算公式：
$\sum_{i=0}^3 \sum_{j=0}^3 a_{ij}x^iy^i$

def bicubic(x):
    """
    BiCubic primary fuction
    """
    x = abs(x)
    a = -0.5
    if x <= 1:
        return (a + 2) * (x**3) - (a + 3) * (x**2) + 1
    elif x < 2:
        return a * (x**3) - 5 * a * (x**2) +  (8 * a * x) - (4 * a)
    else:
        return 0

def bicubic_interpolation(img, new_h, new_w):
    src_height, src_width = img.shape[:2]
    new_img = np.zeros([new_h, new_w])
    for h in range(new_h):
        for w in range(new_w):
            src_x = h * (src_height / new_h)
            src_y = w * (src_width / new_w)
            
            x = int(np.floor(src_x))
            y = int(np.floor(src_y))
            
            u = src_x - x
            v = src_y - y
            temp = 0
            for i in range(-1, 2):
                for j in range(-1, 2):
                    if x + i < 0 or y + j < 0 or x + i >= src_height or y + j >= src_width:
                        continue
                    temp += img[x+i, y+j] * bicubic(i - u) * bicubic(j - v)
            new_img[h, w] = np.clip(temp, 0, 255)
    return new_img

# 最近邻插值法处理一通道的图像
img = cv2.imread('DIP_Figures/DIP3E_Original_Images_CH06/Fig0638(a)(lenna_RGB).tif', 0)

img_up = bicubic_interpolation(img, 800, 800)
# img_up = cv2.resize(img, (800, 800), cv2.INTER_CUBIC)

plt.figure(figsize=(16, 8))
plt.subplot(121), plt.imshow(img, 'gray')
plt.subplot(122), plt.imshow(img_up, 'gray')
plt.tight_layout()
plt.show()

在这里插入图片描述

# 先裁剪图像，然后把裁剪的图像缩小，再进行最近邻、双线内插、双三次内插放大，对比效果
img = cv2.imread('DIP_Figures/DIP3E_Original_Images_CH02/Fig0220(a)(chronometer 3692x2812  2pt25 inch 1250 dpi).tif', 0)

img = img[1100:3500, 200:2600]
img = cv2.resize(img, (200, 200), interpolation=cv2.INTER_CUBIC)

new_h, new_w = 2400, 2400
img_nearest = nearest_neighbor_interpolation(img, new_h, new_w)
img_bilinear = bilinear_interpolation(img, new_h, new_w)
img_bicubic = bicubic_interpolation(img, new_h, new_w)

plt.figure(figsize=(18, 6))
plt.subplot(131), plt.imshow(img_nearest, 'gray'), plt.title('Nearest'), #plt.xticks([]), plt.yticks([])
plt.subplot(132), plt.imshow(img_bilinear, 'gray'), plt.title('Bilinear'), #plt.xticks([]), plt.yticks([])
plt.subplot(133), plt.imshow(img_bicubic, 'gray'), plt.title('Bicubic'), #plt.xticks([]), plt.yticks([])
plt.tight_layout()
plt.show()

在这里插入图片描述

放大图像

def up_sample_2d(img):
    """
    up sample 2d image, if your image is RGB, you could up sample each channel, then use np.dstack to merge a RGB image
    param: input img: it's a 2d gray image
    return a 2x up sample image
    """
    height, width = img.shape[:2]
    temp = np.zeros([height*2, width*2])
    temp[::2, ::2] = img
    temp[1::2, 1::2] = img
    temp[0::2, 1::2] = img
    temp[1::2, 0::2] = img
    return temp

# up sample image using Numpy
img = np.random.random([12, 12])
up = up_sample_2d(img)
down = np.zeros([12, 12])
down = up[::2, ::2]
plt.figure(figsize=(15, 5))
plt.subplot(1,3,1), plt.imshow(img),# plt.xticks([]), plt.yticks([])
plt.subplot(1,3,2), plt.imshow(up),# plt.xticks([]), plt.yticks([])
plt.subplot(1,3,3), plt.imshow(down),# plt.xticks([]), plt.yticks([])
plt.show()

在这里插入图片描述

# 等比2倍放大
img_ori = cv2.imread("DIP_Figures/DIP3E_Original_Images_CH06/Fig0638(a)(lenna_RGB).tif")
img_ori = img_ori[:, :, ::-1]

temp = []
for i in range(img_ori.shape[-1]):
    temp.append(up_sample_2d(img_ori[:, :, i]))
up1 = np.uint8(np.dstack(temp))

temp = []
for i in range(up1.shape[-1]):
    temp.append(up_sample_2d(up1[:, :, i]))
up2 = np.uint8(np.dstack(temp))

plt.figure(figsize=(21, 7))
plt.subplot(1,3,1), plt.imshow(img_ori), plt.title("Original"),# plt.xticks([]), plt.yticks([])
plt.subplot(1,3,2), plt.imshow(up1), plt.title("2X"),# plt.xticks([]), plt.yticks([])
plt.subplot(1,3,3), plt.imshow(up2), plt.title("4X"),# plt.xticks([]), plt.yticks([])
plt.tight_layout()
plt.show()

在这里插入图片描述

最后编辑于：2021.03.01 10:36:23

禁止转载，如需转载请通过简信或评论联系作者。

人面猴
序言：七十年代末，一起剥皮案震惊了整个滨河市，随后出现的几起案子，更是在滨河造成了极大的恐慌，老刑警刘岩，带你破解...
沈念sama阅读 204,293评论 6赞 478
死咒
序言：滨河连续发生了三起死亡事件，死亡现场离奇诡异，居然都是意外死亡，警方通过查阅死者的电脑和手机，发现死者居然都...
沈念sama阅读 85,604评论 2赞 381
救了他两次的神仙让他今天三更去死
文/潘晓璐我一进店门，熙熙楼的掌柜王于贵愁眉苦脸地迎上来，“玉大人，你说我怎么就摊上这事。” “怎么了？”我有些...
开封第一讲书人阅读 150,958评论 0赞 337
道士缉凶录：失踪的卖姜人
文/不坏的土叔我叫张陵，是天一观的道长。经常有香客问我，道长，这世上最难降的妖魔是什么？我笑而不...
开封第一讲书人阅读 54,729评论 1赞 277
港岛之恋（遗憾婚礼）
正文为了忘掉前任，我火速办了婚礼，结果婚礼上，老公的妹妹穿的比我还像新娘。我一直安慰自己，他们只是感情好，可当我...
茶点故事阅读 63,719评论 5赞 366
恶毒庶女顶嫁案：这布局不是一般人想出来的
文/花漫我一把揭开白布。她就那样静静地躺着，像睡着了一般。火红的嫁衣衬着肌肤如雪。梳的纹丝不乱的头发上，一...
开封第一讲书人阅读 48,630评论 1赞 281
城市分裂传说
那天，我揣着相机与录音，去河边找鬼。笑死，一个胖子当着我的面吹牛，可吹牛的内容都是我干的。我是一名探鬼主播，决...
沈念sama阅读 38,000评论 3赞 397
双鸳鸯连环套：你想象不到人心有多黑
文/苍兰香墨我猛地睁开眼，长吁一口气：“原来是场噩梦啊……” “哼！你这毒妇竟也来了？” 一声冷哼从身侧响起，我...
开封第一讲书人阅读 36,665评论 0赞 258
万荣杀人案实录
序言：老挝万荣一对情侣失踪，失踪者是张志新（化名）和其女友刘颖，没想到半个月后，有当地人在树林里发现了一具尸体，经...
沈念sama阅读 40,909评论 1赞 299
护林员之死
正文独居荒郊野岭守林人离奇死亡，尸身上长有42处带血的脓包…… 初始之章·张勋以下内容为张勋视角年9月15日...
茶点故事阅读 35,646评论 2赞 321
白月光启示录
正文我和宋清朗相恋三年，在试婚纱的时候发现自己被绿了。大学时的朋友给我发了我未婚夫和他白月光在一起吃饭的照片。...
茶点故事阅读 37,726评论 1赞 330
活死人
序言：一个原本活蹦乱跳的男人离奇死亡，死状恐怖，灵堂内的尸体忽然破棺而出，到底是诈尸还是另有隐情，我是刑警宁泽，带...
沈念sama阅读 33,400评论 4赞 321
日本核电站爆炸内幕
正文年R本政府宣布，位于F岛的核电站，受9级特大地震影响，放射性物质发生泄漏。R本人自食恶果不足惜，却给世界环境...
茶点故事阅读 38,986评论 3赞 307
男人毒药：我在死后第九天来索命
文/蒙蒙一、第九天我趴在偏房一处隐蔽的房顶上张望。院中可真热闹，春花似锦、人声如沸。这庄子的主人今日做“春日...
开封第一讲书人阅读 29,959评论 0赞 19
一桩弑父案，背后竟有这般阴谋
文/苍兰香墨我抬头看了看天上的太阳。三九已至，却和暖如春，着一层夹袄步出监牢的瞬间，已是汗流浃背。一阵脚步声响...
开封第一讲书人阅读 31,197评论 1赞 260
情欲美人皮
我被黑心中介骗来泰国打工，没想到刚下飞机就差点儿被人妖公主榨干…… 1. 我叫王不留，地道东北人。一个月前我还...
沈念sama阅读 44,996评论 2赞 349
代替公主和亲
正文我出身青楼，却偏偏与公主长得像，于是被迫代替她去往敌国和亲。传闻我的和亲对象是个残疾皇子，可洞房花烛夜当晚...
茶点故事阅读 42,481评论 2赞 342

第2章 Python 数字图像处理(DIP) --数字图像基础3 - 图像内插 - 最近邻内插 - 双线性插值 - 双三次内插 - 图像放大

图像内插

放大图像

推荐阅读更多精彩内容