Zikai Sun's Homepage

Biography

Hey there! I'm Zikai Sun. I'm now a PhD student at the Chinese University of Hong Kong working with Professor Thierry Blu. Before that, I achieved Bachelor degree at South China University of Technology in Information Engineering (the Honors Class).

Publications

Zikai Sun, Thierry Blu. "Empowering Networks With Scale and Rotation Equivariance Using A Similarity Convolution", Accepted at International Conference Learning Representation 2023 (ICLR). PDF

The translational equivariant nature of CNN is a reason for its great success in the field of computer vision. However, networks do not enjoy more general equivariance properties such as rotation or scaling. This limits the generalization performance of the network. In this paper, we devise a method that provides networks with equivariance with respect to translation, rotation, and scaling simultaneously. We define a convolution-like operation and ensure equivariance based on our proposed scalable Fourier-Argand representation. The method has similar efficiency as a traditional network and hardly introduces any additional learnable parameters, since it does not face the computational issue often occurs in group-convolution operator. We verified the quality of our approach in the image classification task, demonstrating the robustness and the generalization ability to both scaled and rotated inputs.

Zikai Sun, Zihan Zhang, Thierry Blu. "An Algebraic Optimization Approach to Image Registration", Proceedings of the 2022 IEEE International Conference on Image Processing (ICIP) Bordeaux, France, October 16--19, 2022. PDF

High-speed is an essential requirement for many applications of image registration. However, existing methods are usually time-consuming due to the difficulty of the task. In this paper, different from usual feature-based ideas, we convert the matching problem into an algebraic optimization task. By solving a series of quadratic optimization equations, the underlying deformation (rotation, scaling and shift) between image pairs can be retrieved. This process is extremely fast and can be performed in real-time. Experiments show that our method can achieve good performance at a much lower computation cost. When used to initialize our earlier parametric Local All-Pass (LAP) registration algorithm, the results obtained improve significantly over the state of the art.

Zikai Sun, Thierry Blu “A Nonlinear Steerable Complex Wavelet Decomposition of Images” Proceedings of the Forty-seventh IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Singapore, May 22--27, 2022. PDF

Signal and image representations that are steerable are essential to capture efficiently directional features. However, those that are successful at achieving directional selectivity usually use too many subbands, resulting in low computational efficiency. In this paper, we propose a two-dimensional nonlinear transform that uses only two subbands to achieve rotation invariance property, and enjoys a mirror reconstruction making it similar to a “tight frame”. The two-subband structure is merged into a unique, concise, complex-valued subband that approximates a Wirtinger gradient which is naturally steerable. Complete steerability, though, is achieved by utilizing the Fourier-Argand representation, which provides a steerable filter able to estimate the amplitude and direction of image features, even in the presence of very high noise. We demonstrate the efficiency of the representation by comparing how it performs in wavelet-based denoising algorithms.

Zikai Sun, Dezhi Peng, Zirui Cai, Zirong Chen, Lianwen Jin Scale Mapping and Dynamic Re-detecting in Dense Head Detection, In 2018 25th IEEE International Conference on Image Processing (ICIP) (pp. 1902 - 1906). IEEE. PDF

Convolutional neural networks (CNNs) have demonstrated a strong ability to extract semantics from images during object detection; however, the extracted semantics are typically have strong scale priors for a specific circumstance. In this paper, we investigate the influence of head scale and contextual information, and then propose a scale-invariant method for head detection. Our method can dynamically detect heads depending on the complexity of the image. It uses an extra feature map to represent the scale information of the spatial relationship, and then uses this feature map for auxiliary detection. Particularly, we exploit several new techniques, including contextual information, scale invariance, and hard example mining. We evaluated our method on three head datasets and achieved state-of-the-art results for the Brainwash dataset, HollywoodHeads dataset, and SCUT-HEAD dataset

Dezhi Peng*, Zikai Sun*, Zirong Chen, Zirui Cai, Lele Xie, Lianwen Jin, Detecting Heads using Feature Refine Net and Cascaded Multi-scale Architecture, In 2018 24th International Conference on Pattern Recognition (ICPR) (pp. 2528 - 2533). IEEE. (* indicate co-first author) PDF

This paper presents a method that can accurately detect heads especially small heads under the indoor scene. To achieve this, we propose a novel method, Feature Refine Net (FRN), and a cascaded multi-scale architecture. FRN exploits the multi-scale hierarchical features created by deep convolu- tional neural networks. The proposed channel weighting method enables FRN to make use of features alternatively and effectively. To improve the performance of small head detection, we propose a cascaded multi-scale architecture which has two detectors. One called global detector is responsible for detecting large objects and acquiring the global distribution information. The other called local detector is designed for small objects detection and makes use of the information provided by global detector. Due to the lack of head detection datasets, we have collected and labeled a new large dataset named SCUT-HEAD which includes 4405 images with 111251 heads annotated. Experiments show that our method has achieved state-of-the-art performance on SCUT-HEAD.

Stuart Middlemiss, David M Roberts, James Grimshaw, Joshua M Edwards, Zikai Sun, Kevin D Whitley, Thierry Blu, Henrik Strahl, Seamus Holden, Molecular motor tug-of-war regulates elongasome cell wall synthesis dynamics in Bacillus subtilis. PDF

Most rod-shaped bacteria elongate by inserting new cell wall material into the inner surface of the cell sidewall. This is primarily performed by a highly conserved protein complex, the elongasome, which moves processively around the cell circumference and inserts long glycan strands that act as barrel-hoop-like reinforcing structures, thereby giving rise to a rod-shaped cell. However, it remains unclear how elongasome synthesis dynamics and termination events are regulated to determine the length of these critical cell-reinforcing structures. To address this, we developed a method to track individual elongasome complexes around the entire circumference of Bacillus subtilis cells for minutes-long periods using single molecule fluorescence microscopy. We found that the B. subtilis elongasome is highly processive and that processive synthesis events are frequent terminated by rapid reversal or extended pauses. We found that cellular levels of RodA regulate elongasome processivity, reversal and pausing. Our single molecule data, together with stochastic simulations, show that elongasome dynamics and processivity are regulated by molecular motor tug-of-war competition between several, likely two, oppositely oriented peptidoglycan synthesis complexes bound to the MreB filament. Our data, thus, demonstrate that molecular motor tug-of-war is a key regulator of elongasome dynamics in B. subtilis, which likely also regulates the cell shape via modulation of elongasome processivity.

Awards

National Scholarship, (Highest national wide scholarship for chinese undergraduate), 2018

National First Prize, China Undergraduate Mathematical Contest in Modeling (#Top 1/162 teams), 2017

The First Prize, SCUT Mathematical Modeling Competition (#Top 1/225 teams), 2017

Principal Investigator, National Undergraduate Technological Innovation Project, 2017

Honorable Mention, Mathematical Contest in Modeling & Interdisciplinary Contest In Modeling, 2017

Third Prize, SCUT Electronic Design Competition, 2016

Experiences

Microsoft Research Asia

Research Intern, 2019.4 - 2019.8

- Doing research on the topic of Human Pose Estimate and Object detection task.

SIAT, Chinese Academic of Science

Research Intern, 2018.12 - 2019.4

- Participate the project “TIA / Minor Stroke recurrent risk prediction” cooperate with the Department of Medicine and Therapeutics at the Chinese University of Hong Kong, responsible for the algorithm part.

HCII-lab, South China Univ. of Tech.

Research Assistant, 2017.3 - 2018.5

- As the team leader, held the project “A method of counting number in classroom based on deep learning”, which obtained CNY$10,000 research funding.
- Developed a software with team members that can detect and count heads in surveillance videos or image on Qt platform.

Rutgers University

International Student, 2017.7 - 2017.8

- Robot control based on Raspberry Pi and sensor intelligence algorithm. - Studied two course: “Introduction to 21st Century Engineering” and “Introduction to Robotics”, both achieved A grades.

Projects

Facial expression DIY Software

- An android-based and weixin-based application combined the method of Generative Adversarial Networks.

Facial DIY software demo:
- An android-based and weixin-based application combined the method of Generative Adversarial Networks. ...details

Hexapod Simulation Robot

- A Hexapod Simulation Robot made from scratch that realized Tripod gait walk algorithm

Underconstruction...
...details

Search engine website

- A whole-network search engine that realized by python and Flask framework.