Image:
ICLR

Two Visual Intelligence papers accepted for prestigious AI conference

New information theories and divergences by Visual Intelligence have been developed and accepted in the prestigious International Conference on Learning Representations (ICLR) 2024. ICLR has an acceptance rate of approximately 30 percent.

Two Visual Intelligence papers accepted for prestigious AI conference

New information theories and divergences by Visual Intelligence have been developed and accepted for the prestigious International Conference on Learning Representations (ICLR) 2024. ICLR has an acceptance rate of approximately 30 percent.

By: Robert Jenssen, Director, Visual Intelligence

Developing new theories to reveal information in deep learning

Modern society is data-driven in the sense that sensors and observations provide measurements. Images are examples of measurement. Key to Visual Intelligence is to reveal and exploit important information from images automatically with deep neural networks to help decision makers. For instance, to reveal information about possible tumors by analysing medical images. In order to do that, it is key to be able to define and quantify information in a mathematical sense and to be able to exploit it.

For instance, it will very often be crucial to be able to quantify in some sense how much information one population of measurements (P) carries about another population of measurements (Q). This is illustrated in a simplified manner in the figure. The difference between P and Q is often called divergence.

P and Q represent different populations of observations. A divergence measure quantify the difference between P and Q. Illustration by Shujian Yu

Introduces a new measure of divergence

The first paper is entitled Cauchy-Schwarz Divergence Information Bottleneck for Regression. The paper’s authors are Shujian Yu, Sigurd Løkse, Robert Jenssen, and Jose Principe.

Professor Jose Principe (University of Florida) and Shuijan Yu (UiT/Free University of Amsterdam) discuss information theory with the idyllic scenery of Tromsø in the background. Photo by Robert Jenssen

In this paper, the aim is to capture as much information as possible about input images while, at the same time, compressing the data representation through a so-called bottleneck. This is highly related to compression, which is crucial to any digital system. The paper develops a new and better way to do this by introducing a new divergence measure.

You may read the paper abstract at the lower portion of this article.

New ways of presenting a population

The second paper is titled MAP IT to Visualize Representations. The paper's author is Robert Jenssen.

In this paper, a ubiquitous challenge in machine learning is tackled. When dealing with data such as images, each observation (image) is often composed of millions of numbers (pixel values). This creates big problems since machine learning systems in general work better when observations are characterized by fewer numbers. It is also very challenging to visualize (“look at”) observations composed of millions of numbers. MAT IT proposes a new way to represent a population such that each observation in the population is composed of only two numbers. This enables plotting of the data set for visualization purposes and helps machine learning systems work better.  

A simplified example is shown below. Small images of handwritten digits are in this case 24 by 24 pixels which means that each image is composed of 576 numbers. MAP IT minimizes the divergence between the set of images with a representation of these images composed only of two numbers. The two numbers represent a dot in a plot. When printing the actual images on top of the dots representing the images, it is clear that the main structure is captured, in the sense that dots corresponding to 4s are separated from dots corresponding to 9s and furthermore separated from dots corresponding to 7s.

The paper abstract can be viewed at the lower portion of this article.

MAP IT is a new way to visualize representations. Illustration provided by Robert Jenssen.

Information about each paper

Cauchy-Schwarz Divergence Information Bottleneck for Regression

By authors Shujian Yu, Sigurd Løkse, Robert Jenssen, Jose Principe.

Open Review link: https://openreview.net/pdf?id=7wY67ZDQTE

Abstract

The information bottleneck (IB) approach is popular to improve the generalization, robustness and explainability of deep neural networks. Essentially, it aims to find a minimum sufficient representation by striking a trade-off between a compression term, which is usually characterized by mutual information I(x; t) where x refers to the input, and a prediction term usually characterized by I (y; t) where y is the desired response. Mutual information is for the IB for the most part expressed in terms of the Kullback-Leibler (KL) divergence, which in the regression case corresponds to prediction based on mean squared error (MSE) loss with Gaussian assumption and compression approximated by variational inference. In this paper, we study the IB principle for the regression problem and develop a new way to parameterize the IB with deep neural networks by exploiting favorable properties of the Cauchy-Schwarz (CS) divergence. By doing so, we move away from MSE-based regression and ease estimation by avoiding variational approximations or distributional assumptions. We investigate the improved generalization ability of our proposed CS-IB and demonstrate strong adversarial robustness guarantees. We demonstrate its superior performance on six real-world regression tasks over other popular deep IB approaches. Additionally, we observe that the solutions discovered by CS-IB always achieve the best trade-off between prediction accuracy and compression ratio in the information plane.

MAP IT to Visualize Representations

By author Robert Jenssen.

Open Review link: https://openreview.net/pdf?id=OKf6JtXtoy

Abstract

MAP IT visualizes representations by taking a fundamentally different approach to dimensionality reduction. MAP IT aligns distributions over discrete marginal probabilities in the input space versus the target space, thus capturing information in wider local regions, as opposed to current methods which align based on pairwise probabilities between states only. The MAP IT theory reveals that alignment based on a projective divergence avoids normalization of weights (to obtain true probabilities) entirely, and further reveals a dual viewpoint via continuous densities and kernel smoothing. MAP IT is shown to produce visualizations which capture class structure better than the current state of the art.

Latest news

Happy Holidays from Visual Intelligence!

December 24, 2024

2024 has been a year full of exciting events and accomplishments, and we look forward to continuing our journey on researching the next generation of deep learning methodology in 2025!

Dagens Medisin: I 2025 vil KI-samarbeidet virkelig komme helsepersonell til gode

December 18, 2024

In a Dagens Medisin op-ed, SPKI director Karl Øyvind Mikalsen, Kristine Bø, Bjorn Anton Graff and Kurt Vanvik discuss how interregional collaborations in Norway now contributes to the implementation of AI radiology solutions across the Norwegian health regions

Centre director Robert Jenssen joins prestigious scientific advisory committee at leading research institute

December 17, 2024

Jenssen has established several important collaborations between Visual Intelligence and international research environments within artificial intelligence. He has now joined the scientific advisory committee at Europe's most prominent research centre within intelligent systems.

Four Innovative Years of SFI Visual Intelligence!

December 15, 2024

2024 marks the research centre's fourth year of researching the next generation of deep learning methodology for extracting knowledge from complex image data. We look back at various innovation highlights achieved in the first half of Visual Intelligence's run.

Successful PhD defense by Rwiddhi Chakraborty

December 13, 2024

Congratulations to Rwiddhi Chakraborty for successfully defending his PhD thesis and achieving the PhD degree in Science at UiT The Arctic University of Norway on December 13th 2024.

Kunstig intelligens som forklarer hva den tenker

November 15, 2024

Professor Michael Kampffmeyer gave a presentation titled "Kunstig intelligens som forklarer hva den tenker" as part of a Norwegian Centre for E-Health Research Webinar (Norwegian dialogue).

16 EUGLOH mobility scholarships for the NLDL 2025 Winter School

October 31, 2024

EUGLOH students from partner institutions under the EUGLOH alliance can now apply for an exclusive mobility scholarship for the NLDL 2025 Winter School which covers travel, accommodation, and sustenance for successful applicants.

Visual Intelligence at Inspirasjonsdagen 2024

October 29, 2024

Visual Intelligence researchers represented the centre during Inspirasjonsdagen 2024. The event aimed to stimulate high school students' interest and curiosity for STEM and healthcare-related study programmes at UiT The Arctic University of Norway.

Insightful student summer projects on machine learning at UiO

October 16, 2024

Our summer students have worked on projects related to fatigue and stress recognition with machine learning as their first research experience. Their results were presented at Georg Sverdrups hus at University of Oslo on October 16th 2024.

Visual Intelligence research talk at the Pioneer Centre for AI

October 15, 2024

PhD candidate Rwiddhi Chakraborty recently gave an invited research talk titled "Perspectives on Multimodal Reasoning" at the Pioneer Centre for AI at University of Copenhagen.

Visual Intelligence represented at Frampeik 2024

October 14, 2024

Visual Intelligence was represented at Frampeik 2024 by associate professor Elisabeth Wetzer. The event gathered student researchers at Verdensteatret in Tromsø for discussions around AI-related topics.

Register for NLDL 2025!

October 4, 2024

Registration for the Northern Lights Deep Learning Conference 2025 is now open. The general deadline for registration is January 1st 2025.

Visual Intelligence at Forskningsdagene 2024

September 30, 2024

Visual Intelligence researchers participated in various dissemination activities throughout Forskningsdagene 2024. The activities aimed to disseminate general knowledge about deep learning and Visual Intelligence's research activities to the general public.

Another successful Visual Intelligence Days!

September 26, 2024

93 people from across the Visual Intelligence (VI) consortium gathered at Quality Airport Hotel Gardermoen for Visual Intelligence Days 2024, 24th to 25th of September.

Successful PhD defense by Ghadi Al Hajj

September 19, 2024

Congratulations to Ghadi Al Hajj for successfully defending his PhD thesis and achieving the degree of Philosophiae Doctor at the University of Oslo on September 16th 2024!