Longformer: The Long-Document Transformer
B S Ashwin
An engineer by training, I rigorously track the cool things happening in the technology world. A strategy Consultant by profession, I am a deep believer in the power of AI & ML in changing the world!
Brown, T., Mann, B., Ryder, N., Subbiah, M., Kaplan, J.D., Dhariwal, P., Neelakantan, A., Shyam, P., Sastry, G., Askell, A. and Agarwal, S., 2020. Language models are few-shot learners. Advances in neural information processing systems, 33, pp.1877-1901.