TBA

Machine Learning

Speaker:

Peter Chang

Institution:

Departments of Radiological Sciences and Computer Science, UCI

Time:

Monday, January 27, 2025 - 11:00am to 12:00pm

Host:

Yifeng Yu (he/him/his)

Location:

410M

AI Grading of Handwritten Math 2A/2B Calculus Tests

Machine Learning

Speaker:

Haobing Mao

Institution:

UCI

Time:

Monday, January 20, 2025 - 11:00am to 12:00pm

Location:

410M

Using the Texera System to Teach Data Science and AI/ML Skills Without Coding

Machine Learning

Speaker:

Chen Li

Speaker Link:

https://chenli.ics.uci.edu/

Institution:

Department of Computer Science, UC Irvine

Time:

Monday, December 2, 2024 - 11:00am to 12:00pm

Host:

Yifeng Yu (he/him/his)

Location:

340P

Abstract:

Texera is an open-source system designed to support a cloud-based platform for collaborative data science through GUI-based workflows. Developed over eight years at UCI, it boasts powerful features such as no/low-code requirements, real-time collaboration, shared editing and execution, version control, parallel computing, and debugging. Texera has been effectively used in various educational settings to teach data science and AI/ML skills, particularly to students with little or no programming experience. Notable examples include the Data Science for All summer program (ds4all.ics.uci.edu), which introduces high school students to data science, and UCI’s ICS 80 course (https://canvas.eee.uci.edu/courses/63639/pages/syllabus), aimed at non-STEM undergraduates. In this talk, we will provide an overview of Texera and highlight its unique strengths in teaching data science and AI/ML skills.

Integrating Neural and Symbolic Reasoning: A Deep Dive into AlphaGeometry for Automated Geometry Theorem Proving

Machine Learning

Speaker:

Mingshuo Liu

Institution:

UC Irvine

Time:

Monday, October 7, 2024 - 11:00am to 12:20pm

Location:

340P

Abstract: Mathematical theorem proving has long been a goal in artificial intelligence (AI), but progress has been slow due to the scarcity of human-generated proofs that can be translated into machine-readable formats. Geometry, in particular, poses unique challenges due to the difficulty of representing geometric figures and relations in a symbolic format that machines can easily process. Researchers at Google DeepMind have recently proposed a novel architecture, AlphaGeometry, which utilizes a neuro-symbolic approach. This system integrates a neural language model trained on a large corpus of synthetic theorems and proofs with a symbolic reasoning engine that ensures logical deductive correctness. On the IMO-AG-30 benchmark, AlphaGeometry achieved remarkable accuracy, solving 25 out of 30 problems, nearly matching the gold medalist benchmark of 25.9/30. In this talk, I will provide an overview of the construction and integration of the symbolic reasoning engine and the neural language model in AlphaGeometry. Specifically, I will detail how the symbolic reasoning engine, comprising the Deductive Database (DD) and Algebraic Reasoning (AR), operates. The DD manages formal geometric deductions by applying predefined theorems and axioms, while AR complements this by resolving complex algebraic expressions inherent in geometric proofs. Furthermore, I will explain the Neural Language Model utilized in AlphaGeometry, highlighting how it differs from known large language models such as GPT-4. This comparison will shed light on the potential of general-purpose language models in solving mathematical problems, offering insights into the broader capabilities of large language models in Math.

On algorithm design for constrained optimization problems in machine learning

Machine Learning

Speaker:

Yue Xie

Speaker Link:

https://yue-xie.github.io

Institution:

University of Hong Kong

Time:

Thursday, June 20, 2024 - 3:00pm to 4:00pm

Host:

Jack Xin (he/him/his)

Location:

306

In this talk, I will focus on resolution of two important subclasses of constrained optimization: bound-constrained problems and linear programming. They are motivated by popular machine learning topics including nonnegative matrix factorization and optimal transport (OT). To resolve the former subclass, I will introduce a two-metric projection method which effectively exploit Hessian information of the objective function. This method inspires several algorithms including a projected Newton-CG equipped with optimal worst-case complexity guarantees, and an adaptive two-metric projection method designed to address l1-norm regularization. For the linear programming formulation of OT, I will discuss random block coordinate descent (RBCD) methods. A direct advantage of these methods is to save memory and we demonstrate its efficiency by comparison with competitors including the classical Sinkhorn algorithm.

Applying Transformer to Time Series: A Variate Focused Approach

Machine Learning

Speaker:

Nhat Thanh Van Tran

Institution:

UCI

Time:

Tuesday, June 4, 2024 - 3:00pm to 4:00pm

Location:

440R

Abstract: In this talk, I will present some new developments of transformers in the time series. In addition, I will introduce the state space model, in particular Mamba and its application in time series if time allows.

Enhancing Model Efficiency: Applications of Tensor Train Decomposition in Machine Learning

Machine Learning

Speaker:

Eric Liu

Institution:

SDSU and UCI

Time:

Tuesday, May 28, 2024 - 3:00pm to 4:00pm

Location:

RH 440R

The application of Tensor Train (TT) decomposition in machine learning models provides a promising approach to addressing challenges related to model size and computational complexity. TT decomposition, by breaking down high-dimensional weight tensors into smaller, more manageable tensor cores, allows for significant reductions in model size while maintaining performance. This presentation will explore how TT decomposition can be effectively used in different types of models.

TT decomposition is adopted differently in recurrent models, Convolutional Neural Networks (CNN), and Binary Neural Networks (BNN). In recurrent models like Long Short-Term Memory (LSTM), large weight matrices are transformed into smaller, manageable tensor cores, reducing the number of parameters and computational load. For CNNs, TT decomposition targets the convolutional layers, transforming convolutional filters into tensor cores to preserve spatial structure while significantly reducing parameters. In BNNs, TT decomposition is combined with weight binarization, resulting in extremely compact models that retain essential information for accurate predictions even with minimal computational power and memory.

The primary aim of this presentation is to explore the theoretical foundations and practical applications of TT decomposition, demonstrating how this technique optimizes various machine learning models. The findings suggest that TT decomposition can greatly enhance model efficiency and scalability, making it a valuable tool for a wide range of applications.

DeepParticle: learning multiscale PDEs with data generated from interacting particle methods

Machine Learning

Speaker:

Jack Xin

Institution:

UCI

Time:

Tuesday, April 30, 2024 - 3:00pm to 4:00pm

Location:

440R

Multiscale time dependent partial differential equations (PDE) are challenging to compute by traditional mesh based methods especially when their solutions develop large gradients or concentrations at unknown locations. Particle methods, based on microscopic aspects of the PDEs, are mesh free and self-adaptive, yet still expensive when a long time or a resolved computation is necessary.

We present DeepParticle, an integrated deep learning, optimal transport (OT), and interacting particle (IP) approach, to speed up generation and prediction of PDE dynamics through two case studies on transport in fluid flows with chaotic streamlines:

1) large time front speeds of Fisher-Kolmogorov-Petrovsky-Piskunov equation (FKPP);

2) Keller-Segel (KS) chemotaxis system modeling bacteria evolution in the presence of a chemical attractant.

Analysis of FKPP reduces the problem to a computation of principal eigenvalue of an advection-diffusion operator. A normalized Feynman-Kac representation makes possible a genetic IP algorithm to evolve the initial uniform particle distribution to a large time invariant measure from which to extract front speeds. The invariant measure is parameterized by a physical parameter (the Peclet number). We train a light weight deep neural network with local and global skip connections to learn this family of invariant measures. The training data come from IP computation in three dimensions at a few sample Peclet numbers.

The training objective being minimized is a discrete Wasserstein distance in OT theory. The trained network predicts a more concentrated invariant measure at a larger Peclet number and also serves as a warm start to accelerate IP computation. The KS is formulated as a McKean-Vlasov equation (macroscopic limit) of a stochastic IP system. The DeepParticle framework extends and learns to generate various finite time bacterial aggregation patterns.