Sitemap

A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.

Page Not Found

Page not found. Your pixels are in another canvas.

Jupyter notebook markdown generator

Posts

Naacl2021

9 minute read

Published: June 12, 2021

Here are the summary of some of talks from NAACL 2021. So far, I only summarized the following papers, I will be summarizing more. So, I will either append it here or a new blog post. Feel free to check back soon.

ICLR 2021

8 minute read

Published: May 12, 2021

ICLR 2021

Edge Probing

3 minute read

Published: November 06, 2020

In the past couple of years, Transformers has acheived state of art results in a variety of natural language tasks. In order to better understand Transformers and what they are learning in practice, researchers have done layer-wise analysis of Transformer’s hidden states to understand what the Transformer is learning in each layer. A wave of recent work has started to “prob” the state of the art Tranformers to inspect the structure of the network to assess whether there exist localizable regions associated with distinct types of linguistic decisions, both syntactic and semantic information. Researchers examine the hidden states between encoder layers directly and use those hidden states in a linear layer + softmax to predict what kind of information in encoded in each hidden state.

Transformers 2

5 minute read

Published: September 22, 2020

This blog post is the continuation of my previous blog post, Transformers. In my previous blog post, I explained original Transformer paper, BERT, GPT, XLNet, RoBERTa, ALBERT, BART, and AMBER. In this blog post, I will explain MARGE, ConveRT, Generalization through Memorization, AdapterHub, and T5. Images and content used in this blogpost, otherwise mentioned, are all taken from the papers on each model.

Text Summarization

9 minute read

Published: September 14, 2020

Automatic summarization is the process of shortening a set of data computationally, to create a subset (a summary) that represents the most important or relevant information within the original content. Text summarization finds the most informative sentences in a document.

NLP Papers

2 minute read

Published: September 03, 2020

These are the most important transformer papers (in my opinion) that anyone working with Transformers should know. Also, there is a nice summary of Efficient Transformers: A Survey by folks at Google that I highly recommend as well.

Transformers

15 minute read

Published: July 30, 2020

Transformers: This post contains my notes throughout years on different transformers. These notes are very crude and not edited yet (more like my cheat sheets), but I thought to share it anyway. Please let me know if you have any comments or if you find any mistakes. Images used in this blogpost, otherwise mentioned, are all taken from the papers on each model.

Fine Tuning T5 for Summary Generation with PyTorch Lightning

less than 1 minute read

Published: July 26, 2020

My Colab notebook on fine tuning T5 model for summarization task using Trenasformers + PyTorch Lightning

Conditional Random Field

2 minute read

Published: April 28, 2020

In this post, I briefly explain what is conditional random Fields and how they can be used for sequence labeling. CRF is a discriminative model best suited for tasks in which contextual information or state of the neighbors affects the current prediction. CRFs are widely used in named entity recognition, part of speech tagging, gene prediction, noise reduction, and object detection problems.

Knowledge Distillation

3 minute read

Published: January 09, 2020

In this post, I will discuss what is knowledge distillation (also refered as Student-Teacher Learning), what is the intuition behind it, and why it works!

Masked Language Modeling + Fine Tuning for Text Classification with BERT

less than 1 minute read

Published: October 13, 2019

My Colab notebook on Masked Language Modeling (MLM) + Fine Tuning for Text Classification with BERT. In this notebook, you can see how to train a BERT model on your data for MLM task and then fine tune it for text classification. This includes how to encode the data, masked the tokens (similar to here) and train a model from scratch (or train on a pretrained model :). You can load this model and fine tuned it on your labeled data for classification.

Covariate Shift and Dataset Shift

2 minute read

Published: March 18, 2019

2018 Conference on Digital Experimentation (CODE)

less than 1 minute read

Published: November 17, 2018

CODE2018

2018 Conference on Digital Experimentation (CODE)

NAACL 2018, Summary of talks

less than 1 minute read

Published: July 26, 2018

There was so much happening at NAACL; so many interesting works on all sorts of (old and new) NLP problems. Lots of papers focused on how to generalize the models beyond the conditions during training. In addition, there was workshop on “New Forms of Generalization in Deep Learning and Natural Language Processing”. In that workshop, Yejin Choi pointed out that natural language understanding (NLU) does not generalize to natural language generation (NLG). Another focus of the conference/workshops were on dialogue systems and chatbots. Lots of talks focused on using a knowledge graph in chatbots to have deeper conversations without staying on the topic for the whole conversations.

QueryUnderstanding

less than 1 minute read

Published: February 01, 2018

See here

publications

Towards Accessible Integrated Formal Reasoning Environments for Protocol Design

Published in , 2012

[PDF]

Recommended citation: Andrei Lapets, Richard Skowyra, Christine Bassem, Sanaz Bahargam, Azer Bestavros, Assaf Kfoury TP2012.

Software-Defined IDS for Securing Embedded Mobile Devices.

Published in 2013 IEEE High Performance Extreme Computing Conference , 2013

[PDF]

Recommended citation: Sanaz Bahargam, Richard Skowyra, Azer Bestavros HPEC2013.

Using Alloy to Formally Model and Reason About an OpenFlow Network Switch

Published in , 2013

[PDF] [code]

Recommended citation: Saber Mirzaei, Sanaz Bahargam, Richard Skowyra, Assaf Kfoury, Azer Bestavros TP2013.

Personalized Education; Solving a Group Formation and Scheduling Problem for Educational Content

Published in The 8 International Conference on Educational Data Mining, 2015

[PDF] [Short Version]

Recommended citation: Sanaz Bahargam, Dóra Erdos, Azer Bestavros, Evimaria Terzi EDM2015.

Profiling the Different Types of Data Scientists: Which One is Right for You?

Published in Winter Conference on Business Intelligence, 2016, 2016

[PDF] [Short version]

Recommended citation: Sanaz Bahargam, Theodoros Lappas WCBI 2016.

Constrained Coupled Matrix-Tensor Factorization and its Application in Pattern and Topic Detection

Published in IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, 2018

[PDF] [Short version]

Recommended citation: Sanaz Bahargam, Evangelos Papalexakis ASONAM 2018.

Discovering Time-Evolving Topics of Varying Levels of Difficulty via Constrained Coupled Matrix-Tensor Factorization

Published in 2018 International Conference on Computational Social Science , 2018

Recommended citation: Sanaz Bahargam, Evangelos Papalexakis IC2S2 2018.

The Guided TeamPartitioning Problem: Definition, Complexity, and Algorithm

Published in The 12 International Conference on Educational Data Mining, 2019

[PDF]

Recommended citation: Sanaz Bahargam, Theodoros Lappas, Evimaria Terzi EDM2019.

Team Formation Algorithm for Faultline Minimization

Published in Expert Systems with Applications, 2019

[PDF]

Recommended citation: Sanaz Bahargam, Behzad Golshan, Theodoros Lappas, Evimaria Terzi Expert Systems with Applications 2019.

Sanaz Bahargam

Sitemap

Pages

Posts

Table of Contents

CODE2018

publications

teaching