detectron2入門示例

參考官網(wǎng)文檔，運(yùn)行示例腳本。

在jupyter內(nèi)運(yùn)行颖医。

# Install detectron2

"""

# install dependencies: (use cu101 because colab has CUDA 10.1)

!pip install -U torch==1.5 torchvision==0.6 -f https://download.pytorch.org/whl/cu101/torch_stable.html

!pip install cython pyyaml==5.1

!pip install -U'git+https://github.com/cocodataset/cocoapi.git#subdirectory=PythonAPI'

import torch, torchvision

print(torch.__version__, torch.cuda.is_available())

!gcc --version

# opencv is pre-installed on colab

# install detectron2:

!pip install detectron2==0.1.2 -f https://dl.fbaipublicfiles.com/detectron2/wheels/cu101/index.html

# You may need to restart your runtime prior to this, to let your installation take effect

# Some basic setup:

# Setup detectron2 logger

import detectron2

from detectron2.utils.loggerimport setup_logger

setup_logger()

# import some common libraries

import numpyas np

import cv2

import random

from google.colab.patchesimport cv2_imshow

# import some common detectron2 utilities

from detectron2import model_zoo

from detectron2.engineimport DefaultPredictor

from detectron2.configimport get_cfg

from detectron2.utils.visualizerimport Visualizer

from detectron2.dataimport MetadataCatalog

"""

# Run a pre-trained detectron2 model

We first download a random image from the COCO dataset:

"""

!wget http://images.cocodataset.org/val2017/000000439715.jpg -Oinput.jpg

im = cv2.imread("./input.jpg")

cv2_imshow(im)

"""Then, we create a detectron2 config and a detectron2 `DefaultPredictor` to run inference on this image."""

cfg = get_cfg()

# add project-specific config (e.g., TensorMask) here if you're not running a model in detectron2's core library

cfg.merge_from_file(model_zoo.get_config_file("COCO-InstanceSegmentation/mask_rcnn_R_50_FPN_3x.yaml"))

cfg.MODEL.ROI_HEADS.SCORE_THRESH_TEST =0.5? # set threshold for this model

# Find a model from detectron2's model zoo. You can use the https://dl.fbaipublicfiles... url as well

cfg.MODEL.WEIGHTS = model_zoo.get_checkpoint_url("COCO-InstanceSegmentation/mask_rcnn_R_50_FPN_3x.yaml")

predictor = DefaultPredictor(cfg)

outputs = predictor(im)

# look at the outputs. See https://detectron2.readthedocs.io/tutorials/models.html#model-output-format for specification

outputs["instances"].pred_classes

outputs["instances"].pred_boxes

# We can use `Visualizer` to draw the predictions on the image.

v = Visualizer(im[:, :, ::-1], MetadataCatalog.get(cfg.DATASETS.TRAIN[0]),scale=1.2)

v = v.draw_instance_predictions(outputs["instances"].to("cpu"))

cv2_imshow(v.get_image()[:, :, ::-1])

"""

# Train on a custom dataset

In this section, we show how to train an existing detectron2 model on a custom dataset in a new format.

We use [the balloon segmentation dataset](https://github.com/matterport/Mask_RCNN/tree/master/samples/balloon)

which only has one class: balloon.

We'll train a balloon segmentation model from an existing model pre-trained on COCO dataset, available in detectron2's model zoo.

Note that COCO dataset does not have the "balloon" category. We'll be able to recognize this new class in a few minutes.

## Prepare the dataset

"""

# download, decompress the data

!wget https://github.com/matterport/Mask_RCNN/releases/download/v2.1/balloon_dataset.zip

!unzip balloon_dataset.zip > /dev/null

"""Register the balloon dataset to detectron2, following the [detectron2 custom dataset tutorial](https://detectron2.readthedocs.io/tutorials/datasets.html).

Here, the dataset is in its custom format, therefore we write a function to parse it and prepare it into detectron2's standard format. See the tutorial for more details.

"""

# if your dataset is in COCO format, this cell can be replaced by the following three lines:

# from detectron2.data.datasets import register_coco_instances

# register_coco_instances("my_dataset_train", {}, "json_annotation_train.json", "path/to/image/dir")

# register_coco_instances("my_dataset_val", {}, "json_annotation_val.json", "path/to/image/dir")

import os

import numpyas np

import json

from detectron2.structuresimport BoxMode

def get_balloon_dicts(img_dir):

json_file = os.path.join(img_dir,"via_region_data.json")

with open(json_file)as f:

imgs_anns = json.load(f)

dataset_dicts = []

for idx, vin enumerate(imgs_anns.values()):

record = {}

filename = os.path.join(img_dir, v["filename"])

height, width = cv2.imread(filename).shape[:2]

record["file_name"] = filename

record["image_id"] = idx

record["height"] = height

record["width"] = width

annos = v["regions"]

objs = []

for _, annoin annos.items():

assert not anno["region_attributes"]

anno = anno["shape_attributes"]

px = anno["all_points_x"]

py = anno["all_points_y"]

poly = [(x +0.5, y +0.5)for x, yin zip(px, py)]

poly = [pfor xin polyfor pin x]

obj = {

"bbox": [np.min(px), np.min(py), np.max(px), np.max(py)],

"bbox_mode": BoxMode.XYXY_ABS,

"segmentation": [poly],

"category_id":0,

"iscrowd":0

? ? ? ? ? ? }

objs.append(obj)

record["annotations"] = objs

dataset_dicts.append(record)

return dataset_dicts

from detectron2.dataimport DatasetCatalog, MetadataCatalog

for din ["train","val"]:

DatasetCatalog.register("balloon_" + d,lambda d=d: get_balloon_dicts("balloon/" + d))

MetadataCatalog.get("balloon_" + d).set(thing_classes=["balloon"])

balloon_metadata = MetadataCatalog.get("balloon_train")

"""To verify the data loading is correct, let's visualize the annotations of randomly selected samples in the training set:"""

dataset_dicts = get_balloon_dicts("balloon/train")

for din random.sample(dataset_dicts,3):

img = cv2.imread(d["file_name"])

visualizer = Visualizer(img[:, :, ::-1],metadata=balloon_metadata,scale=0.5)

vis = visualizer.draw_dataset_dict(d)

cv2_imshow(vis.get_image()[:, :, ::-1])

"""

## Train!

Now, let's fine-tune a coco-pretrained R50-FPN Mask R-CNN model on the balloon dataset. It takes ~6 minutes to train 300 iterations on Colab's K80 GPU, or ~2 minutes on a P100 GPU.

"""

from detectron2.engineimport DefaultTrainer

from detectron2.configimport get_cfg

cfg = get_cfg()

cfg.merge_from_file(model_zoo.get_config_file("COCO-InstanceSegmentation/mask_rcnn_R_50_FPN_3x.yaml"))

cfg.DATASETS.TRAIN = ("balloon_train",)

cfg.DATASETS.TEST = ()

cfg.DATALOADER.NUM_WORKERS =2

cfg.MODEL.WEIGHTS = model_zoo.get_checkpoint_url("COCO-InstanceSegmentation/mask_rcnn_R_50_FPN_3x.yaml")# Let training initialize from model zoo

cfg.SOLVER.IMS_PER_BATCH =2

cfg.SOLVER.BASE_LR =0.00025? # pick a good LR

cfg.SOLVER.MAX_ITER =300? ? # 300 iterations seems good enough for this toy dataset; you may need to train longer for a practical dataset

cfg.MODEL.ROI_HEADS.BATCH_SIZE_PER_IMAGE =128?? # faster, and good enough for this toy dataset (default: 512)

cfg.MODEL.ROI_HEADS.NUM_CLASSES =1? # only has one class (ballon)

os.makedirs(cfg.OUTPUT_DIR,exist_ok=True)

trainer = DefaultTrainer(cfg)

trainer.resume_or_load(resume=False)

trainer.train()

# Commented out IPython magic to ensure Python compatibility.

# Look at training curves in tensorboard:

# %load_ext tensorboard

# %tensorboard --logdir output

"""

## Inference & evaluation using the trained model

Now, let's run inference with the trained model on the balloon validation dataset. First, let's create a predictor using the model we just trained:

"""

cfg.MODEL.WEIGHTS = os.path.join(cfg.OUTPUT_DIR,"model_final.pth")

cfg.MODEL.ROI_HEADS.SCORE_THRESH_TEST =0.7?? # set the testing threshold for this model

cfg.DATASETS.TEST = ("balloon_val", )

predictor = DefaultPredictor(cfg)

"""Then, we randomly select several samples to visualize the prediction results."""

from detectron2.utils.visualizerimport ColorMode

dataset_dicts = get_balloon_dicts("balloon/val")

for din random.sample(dataset_dicts,3):

im = cv2.imread(d["file_name"])

outputs = predictor(im)

v = Visualizer(im[:, :, ::-1],

metadata=balloon_metadata,

scale=0.8,

instance_mode=ColorMode.IMAGE_BW# remove the colors of unsegmented pixels

? ? )

v = v.draw_instance_predictions(outputs["instances"].to("cpu"))

cv2_imshow(v.get_image()[:, :, ::-1])

"""We can also evaluate its performance using AP metric implemented in COCO API.

This gives an AP of ~70%. Not bad!

"""

from detectron2.evaluationimport COCOEvaluator, inference_on_dataset

from detectron2.dataimport build_detection_test_loader

evaluator = COCOEvaluator("balloon_val", cfg,False,output_dir="./output/")

val_loader = build_detection_test_loader(cfg,"balloon_val")

inference_on_dataset(trainer.model, val_loader, evaluator)

# another equivalent way is to use trainer.test

"""

# Other types of builtin models

"""

# Inference with a keypoint detection model

cfg = get_cfg()

cfg.merge_from_file(model_zoo.get_config_file("COCO-Keypoints/keypoint_rcnn_R_50_FPN_3x.yaml"))

cfg.MODEL.ROI_HEADS.SCORE_THRESH_TEST =0.7? # set threshold for this model

cfg.MODEL.WEIGHTS = model_zoo.get_checkpoint_url("COCO-Keypoints/keypoint_rcnn_R_50_FPN_3x.yaml")

predictor = DefaultPredictor(cfg)

outputs = predictor(im)

v = Visualizer(im[:,:,::-1], MetadataCatalog.get(cfg.DATASETS.TRAIN[0]),scale=1.2)

v = v.draw_instance_predictions(outputs["instances"].to("cpu"))

cv2_imshow(v.get_image()[:, :, ::-1])

# Inference with a panoptic segmentation model

cfg = get_cfg()

cfg.merge_from_file(model_zoo.get_config_file("COCO-PanopticSegmentation/panoptic_fpn_R_101_3x.yaml"))

cfg.MODEL.WEIGHTS = model_zoo.get_checkpoint_url("COCO-PanopticSegmentation/panoptic_fpn_R_101_3x.yaml")

predictor = DefaultPredictor(cfg)

panoptic_seg, segments_info = predictor(im)["panoptic_seg"]

v = Visualizer(im[:, :, ::-1], MetadataCatalog.get(cfg.DATASETS.TRAIN[0]),scale=1.2)

v = v.draw_panoptic_seg_predictions(panoptic_seg.to("cpu"), segments_info)

cv2_imshow(v.get_image()[:, :, ::-1])

"""

# Run panoptic segmentation on a video

(by me: can not run because of network issue)網(wǎng)絡(luò)原因，暫時(shí)運(yùn)行不了

"""

# This is the video we're going to process

from IPython.displayimport YouTubeVideo, display

video = YouTubeVideo("ll8TgCZ0plk",width=500)

display(video)

# Install dependencies, download the video, and crop 5 seconds for processing

!pip install youtube-dl

!pip uninstall -y opencv-python opencv-contrib-python

!apt install python3-opencv# the one pre-installed have some issues

!youtube-dl https://www.youtube.com/watch?v=ll8TgCZ0plk -f22 -o video.mp4

!ffmpeg -i video.mp4 -t00:00:06 -c:v copy video-clip.mp4

# Run frame-by-frame inference demo on this video (takes 3-4 minutes)

# Using a model trained on COCO dataset

!git clone https://github.com/facebookresearch/detectron2

!python detectron2/demo/demo.py --config-file detectron2/configs/COCO-PanopticSegmentation/panoptic_fpn_R_101_3x.yaml --video-input video-clip.mp4 --confidence-threshold0.6 --output video-output.mkv \

--opts MODEL.WEIGHTS detectron2://COCO-PanopticSegmentation/panoptic_fpn_R_101_3x/139514519/model_final_cafdb1.pkl

# Download the results

from google.colabimport files

files.download('video-output.mkv')

?著作權(quán)歸作者所有,轉(zhuǎn)載或內(nèi)容合作請(qǐng)聯(lián)系作者

人面猴
序言：七十年代末，一起剝皮案震驚了整個(gè)濱河市羞芍，隨后出現(xiàn)的幾起案子，更是在濱河造成了極大的恐慌郊艘，老刑警劉巖荷科，帶你破解...
沈念sama閱讀 216,496評(píng)論 6贊 501
死咒
序言：濱河連續(xù)發(fā)生了三起死亡事件唯咬，死亡現(xiàn)場(chǎng)離奇詭異，居然都是意外死亡畏浆，警方通過(guò)查閱死者的電腦和手機(jī)胆胰，發(fā)現(xiàn)死者居然都...
沈念sama閱讀 92,407評(píng)論 3贊 392
救了他兩次的神仙讓他今天三更去死
文/潘曉璐我一進(jìn)店門，熙熙樓的掌柜王于貴愁眉苦臉地迎上來(lái)刻获，“玉大人蜀涨，你說(shuō)我怎么就攤上這事〗遥” “怎么了勉盅？”我有些...
開封第一講書人閱讀 162,632評(píng)論 0贊 353
道士緝兇錄：失蹤的賣姜人
文/不壞的土叔我叫張陵，是天一觀的道長(zhǎng)顶掉。經(jīng)常有香客問(wèn)我草娜，道長(zhǎng)，這世上最難降的妖魔是什么痒筒？我笑而不...
開封第一講書人閱讀 58,180評(píng)論 1贊 292
?港島之戀（遺憾婚禮）
正文為了忘掉前任宰闰，我火速辦了婚禮，結(jié)果婚禮上簿透，老公的妹妹穿的比我還像新娘移袍。我一直安慰自己，他們只是感情好老充，可當(dāng)我...
茶點(diǎn)故事閱讀 67,198評(píng)論 6贊 388
惡毒庶女頂嫁案：這布局不是一般人想出來(lái)的
文/花漫我一把揭開白布葡盗。她就那樣靜靜地躺著，像睡著了一般啡浊。火紅的嫁衣襯著肌膚如雪觅够。梳的紋絲不亂的頭發(fā)上，一...
開封第一講書人閱讀 51,165評(píng)論 1贊 299
城市分裂傳說(shuō)
那天巷嚣，我揣著相機(jī)與錄音喘先，去河邊找鬼。笑死廷粒，一個(gè)胖子當(dāng)著我的面吹牛窘拯，可吹牛的內(nèi)容都是我干的。我是一名探鬼主播坝茎，決...
沈念sama閱讀 40,052評(píng)論 3贊 418
雙鴛鴦連環(huán)套：你想象不到人心有多黑
文/蒼蘭香墨我猛地睜開眼涤姊，長(zhǎng)吁一口氣：“原來(lái)是場(chǎng)噩夢(mèng)啊……” “哼！你這毒婦竟也來(lái)了嗤放？” 一聲冷哼從身側(cè)響起思喊，我...
開封第一講書人閱讀 38,910評(píng)論 0贊 274
萬(wàn)榮殺人案實(shí)錄
序言：老撾萬(wàn)榮一對(duì)情侶失蹤，失蹤者是張志新（化名）和其女友劉穎斤吐，沒(méi)想到半個(gè)月后搔涝，有當(dāng)?shù)厝嗽跇淞掷锇l(fā)現(xiàn)了一具尸體厨喂，經(jīng)...
沈念sama閱讀 45,324評(píng)論 1贊 310
?護(hù)林員之死
正文獨(dú)居荒郊野嶺守林人離奇死亡，尸身上長(zhǎng)有42處帶血的膿包…… 初始之章·張勛以下內(nèi)容為張勛視角年9月15日...
茶點(diǎn)故事閱讀 37,542評(píng)論 2贊 332
?白月光啟示錄
正文我和宋清朗相戀三年庄呈，在試婚紗的時(shí)候發(fā)現(xiàn)自己被綠了蜕煌。大學(xué)時(shí)的朋友給我發(fā)了我未婚夫和他白月光在一起吃飯的照片。...
茶點(diǎn)故事閱讀 39,711評(píng)論 1贊 348
活死人
序言：一個(gè)原本活蹦亂跳的男人離奇死亡诬留，死狀恐怖斜纪，靈堂內(nèi)的尸體忽然破棺而出，到底是詐尸還是另有隱情文兑，我是刑警寧澤盒刚，帶...
沈念sama閱讀 35,424評(píng)論 5贊 343
?日本核電站爆炸內(nèi)幕
正文年R本政府宣布，位于F島的核電站绿贞，受9級(jí)特大地震影響因块，放射性物質(zhì)發(fā)生泄漏。R本人自食惡果不足惜籍铁，卻給世界環(huán)境...
茶點(diǎn)故事閱讀 41,017評(píng)論 3贊 326
男人毒藥：我在死后第九天來(lái)索命
文/蒙蒙一涡上、第九天我趴在偏房一處隱蔽的房頂上張望。院中可真熱鬧拒名，春花似錦吩愧、人聲如沸。這莊子的主人今日做“春日...
開封第一講書人閱讀 31,668評(píng)論 0贊 22
一樁弒父案雁佳，背后竟有這般陰謀
文/蒼蘭香墨我抬頭看了看天上的太陽(yáng)。三九已至同云，卻和暖如春糖权，著一層夾襖步出監(jiān)牢的瞬間，已是汗流浹背梢杭。一陣腳步聲響...
開封第一講書人閱讀 32,823評(píng)論 1贊 269
情欲美人皮
我被黑心中介騙來(lái)泰國(guó)打工温兼，沒(méi)想到剛下飛機(jī)就差點(diǎn)兒被人妖公主榨干…… 1. 我叫王不留秸滴，地道東北人武契。一個(gè)月前我還...
沈念sama閱讀 47,722評(píng)論 2贊 368
代替公主和親
正文我出身青樓，卻偏偏與公主長(zhǎng)得像荡含，于是被迫代替她去往敵國(guó)和親咒唆。傳聞我的和親對(duì)象是個(gè)殘疾皇子，可洞房花燭夜當(dāng)晚...
茶點(diǎn)故事閱讀 44,611評(píng)論 2贊 353