My Dev & Engineering Repository

⚠️ 본 내용은 PyTorch Korea의 공식 문서에 기반하여 공부한 내용을 적은것이니 양해바랍니다!

텐서(Tensor)

파이토치(PyTorch) 기본 익히기|| 빠른 시작|| 텐서(Tensor)|| Dataset과 Dataloader|| 변형(Transform)|| 신경망 모델 구성하기|| Autograd|| 최적화(Optimization)|| 모델 저장하고 불러오기 텐서(tensor)는 배열(array)이

tutorials.pytorch.kr

PyTorch (파이토치)

PyTorch는 Facebook's AI Research (FAIR) 팀에 의해 개발된 오픈 소스 딥러닝 프레임워크 입니다.
PyTorch는 빠르고 유연한 프로토타이핑을 지원하며, 다양한 연구 및 산업 응용 프로그램에 널리 사용됩니다.

PyTorch의 주요 특징

동적 계산 그래프 (Dynamic Computation Graph): PyTorch는 동적 계산 그래프를 사용하여 모델을 정의하고 연산을 수행합니다. 이는 각 연산이 실행될 때마다 새로운 그래프를 생성하며, 직관적이고 유연한 모델 정의 및 디버깅을 가능하게 합니다.
자동 미분 (Autograd): PyTorch는 자동 미분 기능을 제공하여, 역전파 알고리즘을 사용한 기울기 계산을 자동으로 처리합니다. 이는 딥러닝 모델 학습 시 매우 유용하며, torch.autograd 모듈을 통해 지원됩니다.
GPU 가속: PyTorch는 CUDA를 통한 GPU 가속을 지원하여, 대규모 데이터와 복잡한 모델을 빠르게 처리할 수 있습니다. 텐서를 GPU로 쉽게 이동하고, GPU에서 연산을 수행할 수 있습니다.
유연한 신경망 모듈: PyTorch는 신경망 모듈을 쉽게 정의하고 조합할 수 있는 유연한 인터페이스를 제공합니다. torch.nn 모듈을 사용하면 다양한 레이어와 손실 함수 등을 간편하게 구현할 수 있습니다.
풍부한 생태계: PyTorch는 다양한 도구와 라이브러리로 구성된 풍부한 생태계를 갖추고 있습니다. 예를 들어, 데이터 로딩을 위한 torchvision, 분산 학습을 위한 torch.distributed, 강화 학습을 위한 TorchRL 등이 있습니다.

Tensor (텐서)

PyTorch Tensor는 PyTorch 라이브러리의 핵심 데이터 구조로, 수치 연산을 위해 사용됩니다.
이는 Numpy의 ndarray와 유사하지만, GPU를 활용한 고속 연산을 지원한다는 점에서 차이가 있습니다.
텐서(tensor)는 배열(array)이나 행렬(matrix)과 매우 유사한 특수한 자료구조입니다.
PyTorch에서는 텐서를 사용하여 모델의 입력(input)과 출력(output), 그리고 모델의 매개변수들을 부호화(encode)합니다.
아래는 PyTorch Tensor에 대한 주요 개념과 특징입니다.

PyTorch Tensor의 주요 특징

Numpy와 유사한 인터페이스: PyTorch Tensor는 Numpy 배열과 유사한 인터페이스를 제공하여, 익숙한 방식으로 다룰 수 있습니다. 그러나 Numpy와 달리 GPU 가속을 통해 더 빠른 연산이 가능합니다.
다양한 차원 지원: 1차원 벡터부터 고차원 텐서까지 다양한 차원의 데이터 구조를 지원합니다. 이를 통해 이미지, 비디오, 음성 등 다양한 데이터를 효과적으로 처리할 수 있습니다.
GPU 가속 지원: PyTorch Tensor는 GPU(CUDA)를 활용한 연산을 지원하여, 대규모 데이터와 복잡한 연산을 빠르게 수행할 수 있습니다. tensor.cuda() 메서드를 사용하여 텐서를 GPU로 이동할 수 있습니다.
자동 미분(Autograd): PyTorch는 자동 미분 기능을 제공하여 딥러닝 모델 학습 시 기울기(Gradient)를 자동으로 계산할 수 있습니다. 이는 역전파 알고리즘을 구현할 때 매우 유용합니다.
다양한 초기화 방법: PyTorch는 다양한 방법으로 텐서를 초기화할 수 있습니다. 예를 들어, 모든 요소가 0인 텐서, 1인 텐서, 랜덤 값으로 초기화된 텐서 등을 생성할 수 있습니다.

텐서는 GPU나 다른 하드웨어 가속기에서 실행할 수 있다는 점만 제외하면 Numpy 의 ndarray와 유사합니다.
실제로 텐서와 NumPy 배열(array)은 종종 동일한 내부(underly) 메모리를 공유할 수 있어 데이터를 복사할 필요가 없습니다.
텐서는 또한 자동 미분(automatic differentiation)에 최적화되어 있습니다.
ndarray에 익숙하다면 Tensor API를 바로 사용할 수 있을 것입니다.

import torch
import numpy as np

Tensor (텐서) 초기화

Tensor는 여러가지 방법으로 초기화 할 수 있습니다. 다른 예제들을 한번 보겠습니다.

데이터로부터 직접(directly) 생성하기

데이터로부터 직접 텐서를 생성할 수 있습니다. 데이터의 자료형(data type)은 자동으로 유추합니다.

# 다차원 리스트를 이용한 텐서 생성
data = [[1, 2],[3, 4]]
x_data = torch.tensor(data)

NumPy 배열로부터 생성하기

텐서는 NumPy 배열로 생성할 수 있습니다.

np_array = np.array(data)
x_np = torch.from_numpy(np_array)

다른 텐서로부터 생성하기

명시적으로 재정의(override)하지 않는다면, 인자로 주어진 텐서의 속성(모양(shape), 자료형(datatype))을 유지합니다.

x_ones = torch.ones_like(x_data) # x_data의 속성을 유지합니다.
print(f"Ones Tensor: \n {x_ones} \n")

x_rand = torch.rand_like(x_data, dtype=torch.float) # x_data의 속성을 덮어씁니다.
print(f"Random Tensor: \n {x_rand} \n")

Ones Tensor: 
 tensor([[1, 1],
        [1, 1]]) 

Random Tensor: 
 tensor([[0.2622, 0.0260],
        [0.2886, 0.1260]])

무작위(random) 또는 상수(constant) 값을 사용하기

shape 은 텐서의 차원(dimension)을 나타내는 튜플(tuple)로, 아래 함수들에서는 출력 텐서의 차원을 결정합니다.

shape = (2,3,)
rand_tensor = torch.rand(shape)
ones_tensor = torch.ones(shape)
zeros_tensor = torch.zeros(shape)

print(f"Random Tensor: \n {rand_tensor} \n")
print(f"Ones Tensor: \n {ones_tensor} \n")
print(f"Zeros Tensor: \n {zeros_tensor}")

Random Tensor: 
 tensor([[0.8583, 0.1453, 0.4702],
        [0.7411, 0.6996, 0.9439]]) 

Ones Tensor: 
 tensor([[1., 1., 1.],
        [1., 1., 1.]]) 

Zeros Tensor: 
 tensor([[0., 0., 0.],
        [0., 0., 0.]])

Others

특정 값으로 초기화

# 모든 요소가 0인 텐서
zero_tensor = torch.zeros((3, 3))

# 모든 요소가 1인 텐서
one_tensor = torch.ones((2, 2))

# 주어진 값으로 텐서 초기화
full_tensor = torch.full((2, 3), 7)

print(zero_tensor)
print(one_tensor)
print(full_tensor)

tensor([[0., 0., 0.],
        [0., 0., 0.],
        [0., 0., 0.]])
tensor([[1., 1.],
        [1., 1.]])
tensor([[7, 7, 7],
        [7, 7, 7]])

범위 값을 갖는 텐서 생성

# 연속적인 값으로 텐서 생성
arange_tensor = torch.arange(0, 10, step=2)

# 균등 간격으로 나뉜 값을 갖는 텐서 생성
linspace_tensor = torch.linspace(0, 1, steps=5)

print(arange_tensor)
print(linspace_tensor)

tensor([0, 2, 4, 6, 8])
tensor([0.0000, 0.2500, 0.5000, 0.7500, 1.0000])

기존 텐서의 크기를 따르는 텐서 생성

# 기존 텐서의 크기를 따르는 0으로 채워진 텐서 생성
existing_tensor = torch.tensor([[1, 2, 3], [4, 5, 6]])
zeros_like_tensor = torch.zeros_like(existing_tensor)

# 기존 텐서의 크기를 따르는 1으로 채워진 텐서 생성
ones_like_tensor = torch.ones_like(existing_tensor)

print(zeros_like_tensor)
print(ones_like_tensor)

tensor([[0, 0, 0],
        [0, 0, 0]])
tensor([[1, 1, 1],
        [1, 1, 1]])

텐서의 속성(Attribute)

텐서의 속성은 텐서의 모양(shape), 자료형(datatype) 및 어느 장치에 저장되는지를 나타냅니다.

텐서의 모양 (Shape)

텐서의 모양은 텐서의 각 차원의 크기를 나타냅니다.

import torch

# 3x2 텐서 생성
tensor = torch.tensor([[1, 2], [3, 4], [5, 6]])

# 텐서의 모양 출력
print(tensor.shape)  # Output: torch.Size([3, 2])

자료형 (Datatype)

텐서의 자료형은 텐서의 요소들이 어떤 데이터 타입인지를 나타냅니다.

# 기본 자료형은 float
float_tensor = torch.tensor([1.0, 2.0, 3.0])
print(float_tensor.dtype)  # Output: torch.float32

# 정수형 텐서 생성
int_tensor = torch.tensor([1, 2, 3], dtype=torch.int32)
print(int_tensor.dtype)  # Output: torch.int32

# 부울형 텐서 생성
bool_tensor = torch.tensor([True, False, True], dtype=torch.bool)
print(bool_tensor.dtype)  # Output: torch.bool

장치 (Device)

텐서의 장치는 텐서가 저장되고 연산되는 하드웨어 장치(CPU 또는 GPU)를 나타냅니다.

# CPU에 텐서 생성
cpu_tensor = torch.tensor([1, 2, 3])
print(cpu_tensor.device)  # Output: cpu

# GPU가 사용 가능한 경우
if torch.cuda.is_available():
    # GPU에 텐서 생성
    gpu_tensor = torch.tensor([1, 2, 3], device='cuda')
    print(gpu_tensor.device)  # Output: cuda:0

Total Example

# 텐서 생성
tensor = torch.tensor([[1.0, 2.0], [3.0, 4.0]], dtype=torch.float64, device='cpu')

# 텐서 속성 출력
print("Shape:", tensor.shape)        # Output: Shape: torch.Size([2, 2])
print("Datatype:", tensor.dtype)     # Output: Datatype: torch.float64
print("Device:", tensor.device)      # Output: Device: cpu

# GPU가 사용 가능한 경우
if torch.cuda.is_available():
    # GPU로 텐서 이동
    tensor = tensor.to('cuda')
    print("Device after moving to GPU:", tensor.device)  # Output: Device after moving to GPU: cuda:0

Shape: torch.Size([2, 2])
Datatype: torch.float64
Device: cpu

Tensor 연산 (Operation)

전치(transposing), 인덱싱(indexing), 슬라이싱(slicing), 수학 계산, 선형 대수, 임의 샘플링(random sampling) 등, 100가지 이상의 텐서 연산들을 아래 링크 에서 확인할 수 있습니다.

torch — PyTorch 2.4 documentation

Shortcuts

pytorch.org

각 연산들은 (일반적으로 CPU보다 빠른) GPU에서 실행할 수 있습니다.
Colab을 사용한다면, Edit > Notebook Settings 에서 GPU를 할당할 수 있습니다.
기본적으로 텐서는 CPU에 생성됩니다.
.to 메소드를 사용하면 (GPU의 가용성(availability)을 확인한 뒤) GPU로 텐서를 명시적으로 이동할 수 있습니다.
장치들 간에 큰 텐서들을 복사하는 것은 시간과 메모리 측면에서 비용이 많이든다는 것을 기억해야 합니다.

# GPU가 존재하면 텐서를 이동합니다
if torch.cuda.is_available():
    tensor = tensor.to("cuda")

NumPy식의 표준 인덱싱과 슬라이싱

tensor = torch.ones(4, 4)
print(f"First row: {tensor[0]}")
print(f"First column: {tensor[:, 0]}")
print(f"Last column: {tensor[..., -1]}")
tensor[:,1] = 0
print(tensor)

First row: tensor([1., 1., 1., 1.])
First column: tensor([1., 1., 1., 1.])
Last column: tensor([1., 1., 1., 1.])
tensor([[1., 0., 1., 1.],
        [1., 0., 1., 1.],
        [1., 0., 1., 1.],
        [1., 0., 1., 1.]])

텐서 합치기: torch.cat 을 사용하여 주어진 차원에 따라 일련의 텐서를 연결할 수 있습니다.

t1 = torch.cat([tensor, tensor, tensor], dim=1)
print(t1)

tensor([[1., 0., 1., 1., 1., 0., 1., 1., 1., 0., 1., 1.],
        [1., 0., 1., 1., 1., 0., 1., 1., 1., 0., 1., 1.],
        [1., 0., 1., 1., 1., 0., 1., 1., 1., 0., 1., 1.],
        [1., 0., 1., 1., 1., 0., 1., 1., 1., 0., 1., 1.]])

산술 연산(Arithmetic operations)

# 두 텐서 간의 행렬 곱(matrix multiplication)을 계산합니다. y1, y2, y3은 모두 같은 값을 갖습니다.
# ``tensor.T`` 는 텐서의 전치(transpose)를 반환합니다.
y1 = tensor @ tensor.T
y2 = tensor.matmul(tensor.T)

y3 = torch.rand_like(y1)
torch.matmul(tensor, tensor.T, out=y3)


# 요소별 곱(element-wise product)을 계산합니다. z1, z2, z3는 모두 같은 값을 갖습니다.
z1 = tensor * tensor
z2 = tensor.mul(tensor)

z3 = torch.rand_like(tensor)
torch.mul(tensor, tensor, out=z3)

tensor([[1., 0., 1., 1.],
        [1., 0., 1., 1.],
        [1., 0., 1., 1.],
        [1., 0., 1., 1.]])

단일-요소(single-element) 텐서: 텐서의 모든 값을 하나로 집계(aggregate)하여 요소가 하나인 텐서의 경우, item() 을 사용하여 Python 숫자 값으로 변환할 수 있습니다:

agg = tensor.sum()
agg_item = agg.item()
print(agg_item, type(agg_item))

# 12.0 <class 'float'>

바꿔치기(in-place) 연산: 연산 결과를 피연산자(operand)에 저장하는 연산을 바꿔치기 연산이라고 부르며, _ 접미사를 갖습니다. 예를 들어: x.copy_(y) 나 x.t_() 는 x 를 변경합니다.

print(f"{tensor} \n")
tensor.add_(5)
print(tensor)

tensor([[1., 0., 1., 1.],
        [1., 0., 1., 1.],
        [1., 0., 1., 1.],
        [1., 0., 1., 1.]]) 

tensor([[6., 5., 6., 6.],
        [6., 5., 6., 6.],
        [6., 5., 6., 6.],
        [6., 5., 6., 6.]])

Numpy 변환 (Bridge)

CPU 상의 텐서와 NumPy 배열은 메모리 공간을 공유하기 때문에, 하나를 변경하면 다른 하나도 변경됩니다.

텐서를 NumPy 배열로 변환하기

t = torch.ones(5)
print(f"t: {t}")
n = t.numpy()
print(f"n: {n}")

t: tensor([1., 1., 1., 1., 1.])
n: [1. 1. 1. 1. 1.]

텐서의 변경 사항이 NumPy 배열에 반영됩니다.

t.add_(1)
print(f"t: {t}")
print(f"n: {n}")

t: tensor([2., 2., 2., 2., 2.])
n: [2. 2. 2. 2. 2.]

NumPy 배열을 텐서로 변환하기

n = np.ones(5)
t = torch.from_numpy(n)

NumPy 배열의 변경 사항이 텐서에 반영됩니다.

np.add(n, 1, out=n)
print(f"t: {t}")
print(f"n: {n}")

t: tensor([2., 2., 2., 2., 2.], dtype=torch.float64)
n: [2. 2. 2. 2. 2.]

저작자표시 비영리 동일조건 (새창열림)

'🔥 PyTorch' 카테고리의 다른 글

[PyTorch] 모델 매개변수 최적화(Optimization) 하기 (0)	2024.07.30
[PyTorch] Torch.Autograd를 이용한 자동 미분 (0)	2024.07.30
[PyTorch] Neural Network Model (신경망 모델) 구성하기 (0)	2024.07.26
[PyTorch] Transform (변형) (0)	2024.07.26
[PyTorch] Dataset & DataLoader with CIFAR-10 (0)	2024.07.26

Notice

PyTorch (파이토치)

PyTorch의 주요 특징

Tensor (텐서)

PyTorch Tensor의 주요 특징

Tensor (텐서) 초기화

데이터로부터 직접(directly) 생성하기

NumPy 배열로부터 생성하기

다른 텐서로부터 생성하기

무작위(random) 또는 상수(constant) 값을 사용하기

Others

텐서의 속성(Attribute)

텐서의 모양 (Shape)

자료형 (Datatype)

장치 (Device)

Total Example

Tensor 연산 (Operation)

NumPy식의 표준 인덱싱과 슬라이싱

산술 연산(Arithmetic operations)

Numpy 변환 (Bridge)

텐서를 NumPy 배열로 변환하기

NumPy 배열을 텐서로 변환하기

'🔥 PyTorch' 카테고리의 다른 글

티스토리툴바

SUBSCRIBE

Notice

PyTorch (파이토치)

PyTorch의 주요 특징

Tensor (텐서)

PyTorch Tensor의 주요 특징

Tensor (텐서) 초기화

데이터로부터 직접(directly) 생성하기

NumPy 배열로부터 생성하기

다른 텐서로부터 생성하기

무작위(random) 또는 상수(constant) 값을 사용하기

Others

텐서의 속성(Attribute)

텐서의 모양 (Shape)

자료형 (Datatype)

장치 (Device)

Total Example

Tensor 연산 (Operation)

NumPy식의 표준 인덱싱과 슬라이싱

산술 연산(Arithmetic operations)

Numpy 변환 (Bridge)

텐서를 NumPy 배열로 변환하기

NumPy 배열을 텐서로 변환하기

'🔥 PyTorch' 카테고리의 다른 글

티스토리툴바