AI 第28天：挑戰專案：多任務 AI 應用

發布於 2024/11/28

作者 Ned Hsu

5 分鐘閱讀

今天的課程將帶領你進行一個挑戰性專案，設計並實作一個多任務 AI 應用。多任務學習（Multi-task Learning, MTL）能讓模型同時處理多個相關任務，達到更高效的性能與更好的泛化能力。這不僅能提升模型的實用性，也讓你熟悉在真實情境中解決多樣化問題的方法。

課程目標

瞭解多任務學習的概念與優勢。
設計並實現一個包含多任務的 AI 專案。
熟悉如何整合模型來解決多樣化問題。

課程內容

1. 多任務學習簡介

1.1 什麼是多任務學習？

多任務學習是一種深度學習方法，目的是讓單一模型同時學習和執行多個相關的任務。它的核心思想是通過共享模型中的部分參數，促進任務之間的知識共享。

範例應用：
1. 自駕車：同時進行車道偵測（分割任務）與障礙物辨識（分類任務）。
2. 文本分析：同時進行情感分析與關鍵字抽取。

1.2 多任務學習的優勢

提升性能：任務之間的互補性可以提升模型的學習效果。
數據效率高：共享數據資源，減少對單一任務大數據的需求。
更好的泛化能力：任務間的正則化效果有助於避免過擬合。

2. 挑戰專案：AI 多任務應用設計

2.1 專案需求

我們將實作一個簡單的多任務應用，結合 NLP 和 CV 技術。以下是專案目標：

任務 1：文本情感分析
- 分析輸入的產品評論，判斷情感為正面或負面。
任務 2：手寫數字辨識
- 識別輸入影像中的手寫數字（0-9）。

2.2 專案規劃

數據集
- 情感分析數據：IMDB 影評數據集。
- 手寫數字數據：MNIST 數據集。
模型架構
- 建立一個共享的神經網路主幹，並為每個任務設計獨立的輸出層：
  - 文本情感分析：Softmax 分類器。
  - 手寫數字辨識：10 類分類器。

3. 專案實作步驟

3.1 安裝與準備

pip install tensorflow numpy pandas scikit-learn matplotlib

3.2 建立模型

以下是多任務模型的實現範例：

  
import tensorflow as tf
from tensorflow.keras import layers, Model

# 輸入層
shared_input = tf.keras.Input(shape=(None, 784), name="shared_input")

# 共享主幹
shared_layer = layers.Dense(128, activation="relu")(shared_input)
shared_layer = layers.Dropout(0.3)(shared_layer)

# 任務 1：情感分析
text_branch = layers.Dense(64, activation="relu")(shared_layer)
text_output = layers.Dense(2, activation="softmax", name="text_output")(text_branch)

# 任務 2：手寫數字辨識
digit_branch = layers.Dense(64, activation="relu")(shared_layer)
digit_output = layers.Dense(10, activation="softmax", name="digit_output")(digit_branch)

# 模型整合
multi_task_model = Model(inputs=shared_input, outputs=[text_output, digit_output])

# 編譯模型
multi_task_model.compile(
    optimizer="adam",
    loss={
        "text_output": "sparse_categorical_crossentropy",
        "digit_output": "sparse_categorical_crossentropy",
    },
    metrics={
        "text_output": "accuracy",
        "digit_output": "accuracy",
    },
)

multi_task_model.summary()

3.3 訓練模型

  
# 輸入數據（需事先處理）
text_data, text_labels = ..., ...
digit_data, digit_labels = ..., ...

# 訓練模型
history = multi_task_model.fit(
    {"shared_input": [text_data, digit_data]},
    {"text_output": text_labels, "digit_output": digit_labels},
    batch_size=32,
    epochs=10,
    validation_split=0.2,
)

3.4 評估與測試

  
# 測試模型
results = multi_task_model.evaluate({"shared_input": test_data}, {"text_output": test_labels, "digit_output": digit_test_labels})
print("測試結果：", results)

4. 課後作業

調整模型結構：嘗試加入更多隱藏層，觀察模型性能的變化。
新任務整合：新增第三個任務，例如影像分割或文本主題分類，擴展模型能力。
模型優化：使用 Grid Search 或 Random Search 對超參數進行調整，提高模型精度。

Software, AI

AI Python

本文章以 CC BY 4.0 授權