torchvision 0.8.0
New datasets
- Added
lfw_people_dataset() and
lfw_pairs_dataset() for loading Labelled Faces in the Wild
(LFW) datasets (@DerrickUnleashed, #203).
- Added
places365_dataset()for loading the Places365
dataset (@koshtiakanksha, #196).
- Added
pascal_segmentation_dataset(), and
pascal_detection_dataset() for loading the Pascal Visual
Object Classes datasets (@DerrickUnleashed, #209).
- Added
whoi_plankton_dataset(),
whoi_small_plankton_dataset(), and
whoi_small_coral_dataset() (@cregouby, #236).
- Added
rf100_document_collection(),
rf100_medical_collection(),
rf100_biology_collection(),
rf100_damage_collection(),
rf100_infrared_collection(), and
rf100_underwater_collection() . Those are collection of
datasets from RoboFlow 100 under the same thematic, for a total of 35
datasets (@koshtiakanksha, @cregouby, #239).
- Added
rf100_peixos_segmentation_dataset(). (@koshtiakanksha,
@cregouby,
#250).
New models
- Added
model_maxvit() for MaxViT: Multi-Axis Vision
Transformer (#229, @koshtiakanksha).
- Added
model_facenet_pnet(),
model_facenet_rnet(), and model_facenet_onet()
for Facenet MTCNN face detection models. (@DerrickUnleashed, #227)
- Added
model_mtcnn() and
model_inception_resnet_v1() models for face detection and
recognition. (@DerrickUnleashed, #217)
- Added
model_mobilenet_v3_large() and
model_mobilenet_v3_small() models for efficient image
classification. (@DerrickUnleashed, #237)
- Added 8 of the
model_convnext_() family models for
image classification, thanks to @horlar1 contribution. (@cregouby, #251)
- Added 2
model_fasterrcnn_resnet50_() models and 2
model_fasterrcnn_mobilenet_v3_large_() for object
detection. (@koshtiakanksha, #251)
New features
- Added
imagenet_label() and
imagenet_classes() for ImageNet classes resolution (#229,
@koshtiakanksha).
base_loader() now accept URLs (@cregouby, #246).
draw_segmentation_masks() now accepts semantic
segmentation models torch_float() output. (@cregouby #247)
- MNIST datasets and Roboflow 100 collections now have a
.getbatch attached method (@cregouby #255)
Bug fixes and improvements
- Switch pre 0.5.0 models to their
/v2/ URL in
torch-cdn.mlverse.org. (#215)
- Models are now separated in the documentation by tasks between
classification, object detection, and semantic segmentation models
(@cregouby,
#247)
- Breaking Change : Refactoring of
coco_* dataset family
now provides each item$x being an image array (for
consistency with other datasets). You can use
transform = transform_to_tensor to restore the previous x
output to be a torch_tensor().
transform_ are now documented into 3 different
categories: unitary transformations, random transformations and
combining transformations. (@cregouby, #250)
- Deprecation :
emnist_dataset is deprecated in favor of
emnist_collection() (@cregouby, #260).
torchvision 0.7.0
New datasets
- Added
fashion_mnist_dataset() for loading the
Fashion-MNIST dataset (@koshtiakanksha, #148).
- Added
eurosat_dataset(),
eurosat_all_bands_dataset(), and
eurosat100_dataset() for loading RGB, all-band, and
small-subset variants of the EuroSAT dataset (@cregouby, #126).
- Added
qmnist_dataset() for loading the QMNIST dataset
(@DerrickUnleashed, #153).
- Added
emnist_dataset() for loading the EMNIST dataset
(@DerrickUnleashed, #152).
- Added
fgvc_aircraft_dataset() for loading the
FGVC-Aircraft dataset (@DerrickUnleashed, #156).
- Added
coco_detection_dataset() and
coco_caption_dataset() for loading the MS COCO detection
and captions datasets (@koshtiakanksha, #161, #172).
- Added
caltech101_dataset() and
caltech256_dataset() for loading the Caltech 101 and 256
datasets (@DerrickUnleashed, #158).
- Added
fer_dataset() for loading the FER-2013 dataset
(@DerrickUnleashed, #154).
- Added
flowers102_dataset() for loading the Flowers102
dataset (@DerrickUnleashed, #157).
- Added
flickr8k_dataset() and
flickr30k_dataset() for loading the Flickr8k and Flickr30k
datasets (@DerrickUnleashed, #159).
- Added
oxfordiiitpet_dataset(),
oxfordiiitpet_binary_dataset(), and
oxfordiiitpet_segmentation_dataset() for loading the
Oxford-IIIT Pet datasets (@DerrickUnleashed, #162).
- Added
rf100_document_collection(),
rf100_underwater_collection(),
rf100_medical_collection(),
rf100_biology_collection(), and
rf100_peixos_segmentation_dataset() for loading Roboflow
100 datasets (@koshtiakanksha, #239).
New models
- Added EfficientNet model family (B0–B7) – scalable CNNs for image
classification. (#166, @koshtiakanksha)
- Added EfficientNetV2 model family (V2-S/M/L) – improved EfficientNet
models for faster training. (#166, @koshtiakanksha)
- Added
model_vit_b_16(), model_vit_b_32(),
model_vit_l_16(), model_vit_l_32(), and
model_vit_h_14() for loading Vision Transformer models
(@DerrickUnleashed, #202).
New features
tensor_image_display() and
tensor_image_browse() now accept all
tensor_image dtypes (@cregouby, #115).
draw_bounding_boxes() and
draw_segmentation_masks() now accept
image_with_bounding_box and
image_with_segmentation_mask inputs which are the default
items class for respectively detection datasets and segmentation
datasets (@koshtiakanksha, #175).
fgvc_aircraft_dataset() gains support for
annotation_level = "all" (@DerrickUnleashed, #168).
folder_dataset() now supports TIFF image formats (@cregouby, #169).
- New
nms() and batched_nms() functions
provide Non-Maximum Suppression utilities. Added
box_convert() to convert between bounding box formats
(@Athospd, #40).
Minor bug fixes and
improvements
transform_rotation() now correctly uses width × height
for image size instead of width × width (@cregouby, #114).
- Clarified documentation for
transform_affine() to
reduce confusion with transform_random_affine() (@cregouby, #116).
- Added French translations for message outputs (@cregouby, #112).
torchvision 0.6.0
- Remove again dependency on
zip::unzip added in version
0.4.0. (#89)
- Improve performance on
tinyimagenet-alexnet example
(#90, @statist-bhfz)
- Updated URL of downloaded resources to use the new torch CDN.
(#109)
torchvision 0.5.1
- Remove usage of
torch_lstsq that was removed in torch
v0.10.0
torchvision 0.5.0
- Bugs fixed in
transform_adjust_hue() and
transform_linear_transformation() (#72, #73, @sebffischer)
- add
draw_bounding_boxes() ,
draw_segmentation_masks() and draw_keypoints()
on top of image tensors, and add a convenience
tensor_image_browse() and
tensor_image_display() functions to visualize image tensors
respectively in browser or in X11 device (#80, @cregouby)
- Added the InceptionV3 model. (#82)
torchvision 0.4.1
- Implemented MobileNetV2 (#60)
- Improved vignettes so they use
nnf_cross_entropy for
numerical stability. (#61)
- Implement the full list of ResNet model family (#66, @cregouby)
- Improved how datasets and models are downloaded by using a large
timeout by default and downloading to temporary file to avoid hard to
debug errors when the files are corrupt. (#67)
torchvision 0.4.0
- Added a dependency on
zip to zip::unzip
the tinyimagenet dataset.
- Removed all usages of
torch::enumerate() from docs and
tests in favor of coro::loop() (#57)
- Fixed non-namespaced calls to
torch. (#58)
torchvision 0.3.0
- Use a self hosted version of the MNIST dataset to avoid frequent
download failures. (#48)
- Fix
torch_arange calls after breaking change in
torch. (#47)
- Fix bug in
transform_resize when passing
size with length 1. (#49)
torchvision 0.2.0
- Fixed bugs in
transform_rotate. (#31)
- Fixed bugs in
transform_random_affine and
transform_affine (#32)
- Added VGG model (#35)
torchvision 0.1.0
- Added a
NEWS.md file to track changes to the
package.