CNN中——PILImage和tensor转换

Post author:xfxia
Post published:2023年9月11日
Post category:其他

PILImage和tensor转换

在我们使用图片训练时，或者使用图片进行预测时，我们都需要过一遍net。

然而图片形式是PILImage的格式。

net需要的时tensor格式的图片。并且形式为（channel, height, width）

我们需要进行转换。

转换的方法有两种。

一、使用numpy->tensor

二、使用PILImage->tensor

使用到两个依赖包：

torch.from_numpy,

torchvision.transforms.

numpy->tensor

PILImage->numpy->tensor

image = Image.open(image_path)  # 读取图片
image = np.array(image, dtype=np.float32)  # PILImage->numpy 输出(h,w,c)
image /= 255.0  # 网络输入需要归一化
image = np.transpos	e(image, (2, 0, 1))  # np下维度转换使用transpose
image = torch.from_numpy(image)  # numpy->tensor
print(image.shape)

使用cv读：

image = cv2.imread(image_path)  # 使用cv读直接为numpy格式
image = image.astype(np.float32)
image /= 255.0
image = np.transpose(image, (2, 0, 1))
image = torch.from_numpy(image)

输出

torch.Size([3, 575, 551])  # (c,h,w)

PILImage->tensor

image = Image.open(image_path)  # 读取图片
image = transforms.ToTensor()(image)  # PILImage->tensor
# image = torch.Tensor.permute(image, (0, 1, 2))  # tensor下维度转换
print(image.shape)

使用torchvision下transforms依赖包，一步到位。

在第二行中，集成处理了以下步骤：

1，img.tobytes() 将图片转化成内存中的存储格式

2，torch.BytesStorage.frombuffer(img.tobytes() ) 将字节以流的形式输入，转化成一维的张量

3，对张量进行reshape

4，对张量进行permute（2,0,1）

5，将当前张量的每个元素除以255

6，输出张量

第三行备注的那句，是在tensor下如果想要转换维度使用的代码。跟之前numpy下不同。

若要将tensor->PILImage

使用相反的：

image = transforms.ToPILImage()(image)

结束~

原文链接：https://blog.csdn.net/giganticpower/article/details/108195052

PILImage和tensor转换

numpy->tensor

PILImage->tensor

结束~

你可能也喜欢