Using MMagic Inferencer on Kaggle

2 min readJan 9, 2025

MMagic (Multimodal Advanced, Generative, and Intelligent Creation) is an open-source AIGC toolbox for professional AI researchers and machine learning engineers to explore image and video processing, editing and generation.

MMagic Inferencer provides access to various pre-trained models in computer vision. I faced challenges setting it up on a Kaggle notebook, but after some trial and error, I figured it out. In this article, I’ll walk you through the setup using their video super-resolution models as an example.

Please refer to this Kaggle Notebook for the code:

MMagic Inference on Kaggle

Explore and run machine learning code with Kaggle Notebooks | Using data from Low Resolution Videos

www.kaggle.com

The MMagic library hasn’t been updated recently, so following the official installation instructions can lead to dependency conflicts. The simple fix is to install specific versions of the dependencies to get MMagic working.

First, install the following version of diffusers package:

!pip install diffusers==0.24.0

Then run the following:

!pip install mmcv==2.1.0
!pip install mmengine
!pip install mmagic

With this you should be good to go. You can import the MMagicInferencer using the following code:

from mmagic.apis import MMagicInferencer

model = MMagicInferencer('basicvsr')

You can view a list of supported models here:

mmagic/mmagic/apis/mmagic_inferencer.py at main · open-mmlab/mmagic

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC)…

github.com

With the setup complete, you can now use MMagic Inferencer to run various computer vision models. If you encounter any issues, checking the specific dependency versions should resolve most conflicts. Happy experimenting with MMagic!

Sign up to discover human stories that deepen your understanding of the world.

Free

Distraction-free reading. No ads.

Organize your knowledge with lists and highlights.

Tell your story. Find your audience.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

Computer Vision

Written by Dan Niles

28 Followers

32 Following

Computer Science & Engineering Undergraduate @ University of Moratuwa, Sri Lanka

No responses yet

Write a response

What are your thoughts?

Also publish to my profile

More from Dan Niles

Authentication and Authorization in Apache Solr using the Solr Operator (on Kubernetes)

Dan Niles

Authentication and Authorization in Apache Solr using the Solr Operator (on Kubernetes)

In this tutorial we will see how to setup authentication and authorization in Apache Solr (using the Solr Operator) on Kubernetes. If you…

Jan 30, 2024

Setting up Apache Solr on Kubernetes (with Rancher Desktop)

Dan Niles

Setting up Apache Solr on Kubernetes (with Rancher Desktop)

Solr is a popular, blazing-fast, open-source enterprise search platform built on Apache Lucene™. Solr exhibits exceptional reliability…

Jan 13, 2024

Spellchecking on Apache Solr (in SolrCloud mode)

Dan Niles

Spellchecking on Apache Solr (in SolrCloud mode)

Spellcheck is a useful feature to provide to your users when building your search app. Apache Solr natively supports spellchecking, making…

Mar 3, 2024

Integrating Apache Solr with Ballerina Central to Enhance Search Capabilities

Dan Niles

Integrating Apache Solr with Ballerina Central to Enhance Search Capabilities

During my internship at WSO2, I had the opportunity to work on integrating Apache Solr with Ballerina Central, the official package…

Feb 2

See all from Dan Niles

Recommended from Medium

Abhishek Jain

IoU (Intersection over Union)

Intersection over Union (IoU) is used to evaluate the performance of object detection by comparing the ground truth bounding box to the…

Jan 20

YOLO v3 v5 v8 explanation | YOLO vs. Faster R-CNN

Jo Wang

YOLO v3 v5 v8 explanation | YOLO vs. Faster R-CNN

YOLO (You Only Look Once): YOLO treats object detection as a regression problem, predicting bounding boxes and class probabilities directly…

Oct 20, 2024

Lists

Natural Language Processing

1977 stories1619 saves

Mert

How to use YOLOv11 for Object Detection

Introduction

Oct 5, 2024

Image Segmentation in Machine Learning: A Step-by-Step Guide

Daniel García

Image Segmentation in Machine Learning: A Step-by-Step Guide

If you’ve ever wondered how self-driving cars recognize objects on the road or how medical imaging software detects tumors, the answer…

Sep 23, 2024

Object detection with Vision Transformers

AI Innovator From PrismAI

Abhijat Sarari

Object detection with Vision Transformers

Object detection is a core task in computer vision, powering technologies from self-driving cars to real-time video surveillance. It…

Oct 20, 2024

Exploring EfficientAD: Accurate Visual Anomaly Detection at Millisecond-Level Latencies: A Brief…

Towards AI

Vincent Liu

Exploring EfficientAD: Accurate Visual Anomaly Detection at Millisecond-Level Latencies: A Brief…

A Real-Time Anomaly Detection Network surpasses all the existing networks

May 8, 2024

See more recommendations

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams