Skip to content
View RosalindFok's full-sized avatar
🏳️‍🌈
Focusing
🏳️‍🌈
Focusing
  • Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences
  • 1068 Xueyuan Avenue, Shenzhen University Town, Shenzhen, P.R.China
  • 16:51 (UTC +08:00)
  • https://orcid.org/0009-0001-7409-1588

Block or report RosalindFok

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
RosalindFok/README.md

✨Rosalind Fok✨

Rainbow Flag

Education

Bachelor of Engineering(computer science and technology): School of Computer Science and Technology, University of Chinese Academy of Sciences. My thesis was advised by Beihong Jin
Master of Philosophy(computer science and technology): Center for Biomedical Information Technology, Institute of Advanced Computing and Digital Engineering, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences. I was advised by Yunpeng Cai

Current Interests

AI4Science Encoding and Decoding for higher cognitive function of human's brain.
Incomplete understanding of the pathophysiology of mental disorders.
Graph Embedding and Geometry Learning.

Blogs

人工智能赋能自然科学(AI for Science, AI4Sci)的科研方法杂谈
数字图像处理中的二维离散傅里叶变换的性质

Published

[1] Yufu HUO,Beihong JIN,Zhaoyi LIAO. Multi-modal information augmented model for micro-video recommendation. Journal of ZheJiang University (Engineering Science), 2024, 58(6): 1142-1152.

Code: https://github.com/RosalindFok/MMa4CTR

Abstract: A multi-modal augmented model for click through rate (MMa4CTR) tailored for micro-videos recommendation was proposed. Multi-modal data derived from user interactions with micro-videos were effectively leveraged to construct embedded user representations and capture diverse user interests across multi-modal. The aim was to reveal the latent semantic commonalities, by combining and crossing features across modalities. The overall recommendation performance was boosted via two training strategies, automatic learning rate adjustment and validation interruption. A computationally efficient multi-layer perceptron architecture was employed, in order to address the computational demands brought on by the vast amount of multi-modal data. Performance comparison experiments and sensitivity analyses of hyperparameter on WeChat Video Channel and TikTok datasets demonstrated that MMa4CTR outperformed baseline models, delivering superior recommendation results with minimal computational resources. Additionally, ablation studies performed on both datasets further validated the significance and efficacy of the micro-video modality cross module, the user multi-modal embedding layer, and the strategies for automatic learning rate adjustment and validation interruption in enhancing recommendation performance.

Key words: recommender system, click through rate, multi modal, micro-video, machine learning

Link: https://www.zjujournals.com/eng/CN/Y2024/V58/I6/1142

Dusai's GitHub stats Rosalind Fok's Most used languages

Pinned Loading

  1. Brainnetome4Depression Brainnetome4Depression Public

    Brainnetome: Theory, Methods and Applications. (UCAS 2023)

    Jupyter Notebook 2

  2. SAM4own SAM4own Public

    Segment-Anything model for my own use

    Python

  3. 3D-BrainTumorSegmentat4MIA 3D-BrainTumorSegmentat4MIA Public

    Medical Image Analysis(UCAS 2024 Spring)

    Python 1

  4. MMa4CTR MMa4CTR Public

    MMa4CTR: a multi-modal information augmented model for micro-video recommendation

    Python 4

  5. Segmentation-of-Mitochondria-and-EndoplasmicReticulum-via-UNet Segmentation-of-Mitochondria-and-EndoplasmicReticulum-via-UNet Public

    Biological Image Processing and Informatics(UCAS 2024 Spring)

    Jupyter Notebook