.js-id-AI-Computing-Cyberinfrastructure
(arXiv2024) FengWu-GHR | Learning the Kilometer-scale Medium-range Global Weather Forecasting
FengWu-GHR: Learning the Kilometer-scale Medium-range Global Weather Forecasting The first data-driven global weather forecasting model running at the 0.09◦ horizontal resolution. FengWu-GHR introduces a novel approach that opens the door for operating ML-based high-resolution forecasts by inheriting prior knowledge from a pretrained low-resolution model.
(arXiv2024) FengWu-GHR | Learning the Kilometer-scale Medium-range Global Weather Forecasting
(NeuIPS2023) SwapPrompt | Test-Time Prompt Adaptation for Vision-Language Models
In this paper, we propose SwapPrompt, a novel framework that can effectively leverage the self-supervised contrastive learning to facilitate the test-time prompt adaptation. SwapPrompt employs a dual prompts paradigm, i.e., an online prompt and a target prompt that averaged from the online prompt to retain historical information.
(NeuIPS2023) SwapPrompt | Test-Time Prompt Adaptation for Vision-Language Models
(NeuIPS2023) Towards Test-Time Refusals via Concept Negation
PROTORE works by incorporating CLIP’s language-contrastive knowledge to identify the prototype of negative concepts, extract the negative features from outputs using the prototype as a prompt, and further refine the attention maps by retrieving negative features.
(NeuIPS2023) Towards Test-Time Refusals via Concept Negation
(EuroSys2024) Warming Serverless ML Inference via Inter-function Model Transformation
Serverless ML inference is an emerging cloud computing paradigm for low-cost, easy-to-manage inference services. In serverless ML inference, each call is executed in a container; however, the cold start of containers results in long inference delays.
(EuroSys2024) Warming Serverless ML Inference via Inter-function Model Transformation
(INFOCOM2024) An Elastic Transformer Serving System for Foundation Model via Token Adaptation
Transformer model empowered architectures have become a pillar of cloud services that keeps reshaping our society. However, the dynamic query loads and heterogeneous user requirements severely challenge current transformer serving systems, which rely on pre-training multiple variants of a foundation model to accommodate varying service demands.
(INFOCOM2024) An Elastic Transformer Serving System for Foundation Model via Token Adaptation
(KDD2023) Investigating Trojan Attacks on Pre-trained Language Model-powered Database Middleware
The recent success of pre-trained language models (PLMs) such as BERT has resulted in the development of various beneficial database middlewares, including natural language query interfaces and entity matching. This shift has been greatly facilitated by the extensive external knowledge of PLMs.
(KDD2023) Investigating Trojan Attacks on Pre-trained Language Model-powered Database Middleware
Intelligence-Endogenous Management Platform for Computing and Network Convergence
Massive emerging applications are driving demand for the ubiquitous deployment of computing power today. This trend not only spurs the recent popularity of the Computing and Network Convergence (CNC), but also introduces an urgent need for the intelligentization of a management platform to coordinate changing resources and tasks in the CNC.
Intelligence-Endogenous Management Platform for Computing and Network Convergence
A Unified TinyML System for Multi-modal Edge Intelligence and Real-time Visual Perception.
Our research focuses on the software and hardware synergy of on-device learning techniques, covering the scope of model-level neural network design, algorithm-level training optimization and hardware-level arithmetic acceleration.
A Unified TinyML System for Multi-modal Edge Intelligence and Real-time Visual Perception.
Adaptive Quantization-aware Training and Model Compression.
Our research focuses on the software and hardware synergy of on-device learning techniques, covering the scope of model-level neural network design, algorithm-level training optimization and hardware-level arithmetic acceleration.
Adaptive Quantization-aware Training and Model Compression.
A Unified Contrastive Representation Learner for Cross-modal Federated Learning Systems.
Our research focuses on the software and hardware synergy of on-device learning techniques, covering the scope of model-level neural network design, algorithm-level training optimization and hardware-level arithmetic acceleration.
A Unified Contrastive Representation Learner for Cross-modal Federated Learning Systems.
Progressive Network Sparsification and Latent Feature Compression for Scalable Collaborative Learning.
Our research focuses on the software and hardware synergy of on-device learning techniques, covering the scope of model-level neural network design, algorithm-level training optimization and hardware-level arithmetic acceleration.
Progressive Network Sparsification and Latent Feature Compression for Scalable Collaborative Learning.
Masked Autoencoders for Occlusion-aware Visual Learners
Our research focuses on the software and hardware synergy of on-device learning techniques, covering the scope of model-level neural network design, algorithm-level training optimization and hardware-level arithmetic acceleration.
Masked Autoencoders for Occlusion-aware Visual Learners
Flexible Patch Skip for Real-time Visual Perception.
Our research focuses on the software and hardware synergy of on-device learning techniques, covering the scope of model-level neural network design, algorithm-level training optimization and hardware-level arithmetic acceleration.
Flexible Patch Skip for Real-time Visual Perception.
Efficient Federated Learning Framework on Heterogeneous Environment
Federated learning (FL) has been proposed as a promising solution for future AI applications with strong privacy protection. It enables distributed computing nodes to collaboratively train models without exposing their own data.
Efficient Federated Learning Framework on Heterogeneous Environment
Next generation blockchain system
Our team aims at the next-generation blockchain system with scalability, security, privacy, and intelligence and our proposed architecture is composed of 6 layers as above. In the following, the details of these 6 layers will be explained from top to bottom.
Next generation blockchain system
Radiation-free Spine Reconstruction and Posture Analysis Techniques with 3D Imaging
Scoliosis is a sideways curvature of the spine that occurs most often during thegrowth spurt just before puberty. According to the survey and statistics of China Child Development Center, more than 20% teens have scoliosis.
Radiation-free Spine Reconstruction and Posture Analysis Techniques with 3D Imaging
Edge AI in smart city
Research Overview Our team aims to design promising solutions for future AI applications based on edge intelligent technologies, which can empower construction, public health, environment, transportation and other industries, and promote the upgrading of urban intelligence.
Edge AI in smart city
Edge Application Layer in Blockchain-empowered Edge Learning
Blockchain-empowered edge learning is a novel distributed learning architecture to dispense with a dedicated server in traditional distributed learning and provide trustworthy training for edge devices.
Edge Application Layer in Blockchain-empowered Edge Learning
Heterogeneous Data & Resource Constraints- Batch Size Adaptation
To tackle non-IID data challenge in FL, we consider to design a new method to improve training efficiency of each client from the perspective of whole training process.
Heterogeneous Data \& Resource Constraints- Batch Size Adaptation
Heterogeneous Data & Expensive Communication- Layer-wised Aggregation
We design a novel pFL training framework dubbed Layer-wised Personalized Federated learning (pFedLA) that can discern the importance of each layer from different clients, and thus is able to optimize the personalized model aggregation for clients with heterogeneous data.
Heterogeneous Data \& Expensive Communication- Layer-wised Aggregation
Semantic Query and Index Layer in Semantic Blockchain Database
Blockchain database is a new direction that constructs index on top of blockchain to provide rich query functionalities. The existing works are either insecure because the query process separates from the blockchain consensus, or inscalable because all the data needs to be stored in the block. Therefore, we propose an authenticated semantic database layer for blockchains.
Semantic Query and Index Layer in Semantic Blockchain Database
Heterogeneous Hardware & Data- Parameterized Knowledge Transfer
Most existing pFL methods rely on model parameters aggregation at the server side, which require all models to have the same structure and size. Such constraints would prevent status quo pFL methods from further application in practical scenarios, where clients are often willing to own unique models, i.e., with customized neural architectures to adapt to heterogeneous capacities in computation, communication and storage space, etc. We seek to develop a novel training framework that can accommodate heterogeneous model structures for each client and achieve personalized knowledge transfer in each FL training round.
Heterogeneous Hardware \& Data- Parameterized Knowledge Transfer
Intelligent Consensus Layer in Learning-Driven Dynamic Architecture
Most existing blockchain systems adopt a static policy that cannot efciently deal with the dynamic environment in the blockchain system, i.e., joining and leaving of nodes, and malicious attack. Therefore, we propose a novel dynamic sharding-based blockchain framework to achieve a good balance between performance and security without compromising scalability under a dynamic environment.
Intelligent Consensus Layer in Learning-Driven Dynamic Architecture
Lack of participants- Incentive Mechanism Design for Federated Learning
A few of works have designed incentive mechanisms for FL, but these mechanisms only consider myopia optimization on resource consumption, which results in the lack of learning algorithm performance guarantee and long-term sustainability. We propose Chiron, an incentive-driven long-term mechanism for edge learning based on hierarchical deep reinforcement learning.
Lack of participants- Incentive Mechanism Design for Federated Learning
Layered Sharding Architecture for Blockchain
Most existing blockchain systems adopt a static policy that cannot efciently deal with the dynamic environment in the blockchain system, i.e., joining and leaving of nodes, and malicious attack. Therefore, we propose a novel dynamic sharding-based blockchain framework to achieve a good balance between performance and security without compromising scalability under a dynamic environment.
Layered Sharding Architecture for Blockchain
Sustainable Off-chain Payment Channel Network
Payment channel network (PCN) is the most promising off-chain technologies to support massive micro payments for blockchain. The technology has been deployed in a number of blockchains including Bitcoin and Ethereum.
Sustainable Off-chain Payment Channel Network
Hybrid On-/Off-Chain Distributed Storage
Personal data produced from widely emerged cyberspace activities are expected to promote information dissemination and engagement, or even make business intelligence more powerful.
Hybrid On-/Off-Chain Distributed Storage
Anti-Occlusion Human Pose Estimation for Scoliosis Rehabilitation
Physiotherapeutic scoliosis-specific exercises (PSSE) have been proved to be effective in scoliosis rehabilitation. PSSE consist of a program of curve-specific exercise protocols which are individually adapted to a patients’ curve site, magnitude, and clinical characteristics.
Anti-Occlusion Human Pose Estimation for Scoliosis Rehabilitation
Next Generation AI Video Analytics Detection System on Driving Behavior and Mental Factors
Video-based abnormal driving behavior and mental detection is becoming more and more popular. The key goal is to ensure the safety of drivers and passengers in the vehicle, and it is an essential step to realize autonomous driving at this stage.
Next Generation AI Video Analytics Detection System on Driving Behavior and Mental Factors
Emergency Risk Management in Smart City
With the emergence and drastic improvement of mobile devices (e.g., phones, tablets, drones, and autonomous vehicles), we are now witnessing an exciting revolution of the digital city.
Emergency Risk Management in Smart City
New Architectures and Methodologies for High Performance Sharding Blockchain
Blockchain draws tremendous attention from academia and industry, since it can provide distributed ledgers with data transparency, integrity, and immutability to untrusted parties for various decentralized applications.
New Architectures and Methodologies for High Performance Sharding Blockchain
Federated Learning in Resourced Constrained Mobile Edge Network
Federated learning (FL) has been proposed as a promising solution for future AI applications with strong privacy protection. It enables distributed computing nodes to collaboratively train models without exposing their own data.
Federated Learning in Resourced Constrained Mobile Edge Network