[1]. Fei Wang, Mengqing Jiang, Chen Qian, Shuo Yang, Cheng Li, Honggang Zhang, Xiaogang Wang, Xiaoou Tang| Residual Attention Network for Image Classification
[2]. Dzmitry Bahdanau, KyungHyun Cho Yoshua Bengio| NEURAL MACHINE TRANSLATION BY JOINTLY LEARNING TO ALIGN AND TRANSLATE
[3]. Ilya Loshchilov, Frank Hutter| SGDR: STOCHASTIC GRADIENT DESCENT WITH WARM RESTARTS
[4]. Min Lin, Qiang Chen, Shuicheng Yan| Network In Network
[5]. Rupesh Kumar Srivastava, Klaus Greff, Jurgen Schmidhuber| Training Very Deep Networks
[6]. Rupesh Kumar, Klaus Greff, Jurgen Schmidhuber| Highway Networks
[7]. Ziyu Xu, Chen Dan, Justin Khim, Pradeep Ravikumar| Class-Weighted Classification: Trade-offs and Robust Approaches
[8]. Kelvin Xu, Jimmy Lei Ba, Ryan Kiros, Kyunghyun Cho, Aaron Courville, Ruslan Salakhutdinov, Richard S. Zemel, Yoshua Bengio| Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
[9]. Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, Illia Polosukhin| Attention Is All You Need
[10]. Muhammad Imran Razzak, Saeeda Naz and Ahmad Zaib| Deep Learning for Medical Image Processing: Overview, Challenges and Future
[11]. Bhavya Ajani| Automatic Intracranial Brain Segmentation from Computed Tomography Head Images
[12]. Nhan T. Nguyen, Dat Q. Tran, Nghia T. Nguyen, Ha Q. Nguyen| A CNN-LSTM Architecture for Detection of Intracranial Hemorrhage on CT scans