Arjun:
Hello Everyone,
Milestone: Dec 20 | Internal Deadline for the paper Draft |
What is Completed?
Pipeline for the End-to-End Implementation.
What is pending?
Design Decisions, primarily use of the columns and how to flip the sensitive attribute.
Writeup for the Experiment, Results, and sections.
I propose two forms of processing for the dataset.
All the columns included in AI360 are used for the datasets (Adult, Compas, Credit)
I would like to sign up for a discussion tomorrow.
Replies:
Dr. Lei:
do you have a first draft of the three sections, Background, Approach, and Experiments?
Arjun:
Hi Group,
Milestone: Dec 20 ( Paper’s first Draft )
I have created on end-to-end implementation of the approach for one dataset ( Adult ) and obtained the results.
I will be getting results for other two datasets ( COMPAS, CREDIT ) by late afternoon to evening.
Summary of Results:
The t_way_samples are obtained from the latent space representation of the training instances.
The ATN was found to be higher when means of bin was used to generate t_way_samples. ( For t = 4, ATN: 0.775 )
Furthermore, using bin_means from discretized columns resulted in better results for the ratio of discriminatory instances compared to ratio of discriminatory instances from the “test set” of the adult income dataset.
I would like to signup for discussion for feedback on the results & further experiments.
Replies:
Dr. Lei:
sounds like good progress. also remember to discuss the issue of ground truth/discretization you mentioned during the meeting. please try to have an agenda and also what you expect to get out of the discussion, in terms of action items and decisions to make.
Arjun:
Hi All,
Milestone: Dec 24, Paper’s Internal Deadline
I have completed the following action items:
Using the mean value of the bin results in a slightly better ATN.
I will present briefly the results if the time permits.
I have an implementation of Conditional VAE using Pytorch. However, I am trying to integrate the implementation with the existing library for better results.
I will complete this last action item and write the approach section as soon as possible for the design decision.
Replies:
Dr. Lei:
great job. now it is time to start on paper writing
Arjun:
Hello Everyone,
Milestone: Dec 24, Paper’s Internal Deadline
I am following up on the action items from the past week:
The target date for the action items listed above was today to keep in track with the milestone; however, I was pretty busy this weekend with TA duties. I will follow up on these action items and their results via Slack, by tomorrow evening.
Replies:
Dr. Lei:
I would suggest you take off “possibly”. mindset matters. things happen only with commitment.
Arjun:
Hello Everyone,
Milestone: Nov 29: Complete Pipeline for current approach along with results
Action Items performed this week:
Project Page: https://ajdahal.github.io/portfolio/
I will be presenting on new idea session today.
Replies:
Dr. Lei:
the project page does not seem to be in a good state. also please summarize the results of your action items, which would be more informative.
Arjun:
Hello Group,
Milestone: Baseline for Fairness Testing | Nov 12 |
Action Item completed this week:
I have been doing some experiments with regards to VAE for generating data points.
Fixed the issue with kl divergence for ‘hours-per-week’ observed in the last meeting.
I have obtained metrics for Reconstruction Loss vs Number of Embedding Dimension.
ATN and Kldivergence for random binning of embedding dimension.
Currently, I am carrying out t-way combination of latent space vectors. I believe I will have results by this afternoon.
Action Items for next week:
I would like to book 15 minutes of time for discussion on the results and the approach.
Arjun:
Hello Group,
Milestone: Nov 12 | Baseline for Fairness Testing |
I am still working on the experiments regarding fairness testing for Variational AutoEncoder.
I will share some results on slack before Nov 12 regarding the process with some metrics.
Replies:
Dr. Lei:
since you are targeting IWCT, whose deadline is fast approaching, it would help you can work out a schedule, with break-down tasks and target dates. the schedule will guide all your efforts toward the paper submission.
Arjun:
Hello Everyone,
Milestone: Baseline for Fairness Testing ( Nov 12 )
I am working on making changes on the existing library of VAE to encorporate the combinatorial sampling in it’s latent space. Furthermore, I am also looking into creating conditional VAE and recording metrics for ATN and KL divergence along the way. I am currently working on the implementation.
I will briefly discuss the findings this Friday.
Arjun:
Hi Group,
Milestone: Baseline for Fairness Testing
I have collected and reviewed the necessary literature for the fairness testing with variational auto-encoder and implemented the necessary code for conditional variational auto-encoder. I would like to signup for a brief discussion tomorrow regarding the review and the results.
Replies:
Dr. Lei:
do you want to put a target date on your milestone?
Arjun:
Hi Group,
Milestone: Finalize the baseline approach Oct 18
I have been working on the following action items since last week:
I have made some progress with the action items above, but haven’t worked till completion primarily due to travel.
I will update my progress everyday till coming Friday to ensure the baseline approach is finalized.
Replies:
Dr. Lei:
try your best to make the milestone
Arjun:
Milestone: Finalize the baseline approach for Fairness testing ( 18 / 08 )
Intermediate Goals:
A. Complete literature search for fairness testing using data samples that reflect realistic changes when sensitive attribute is flipped for fairness testing. ( 11 / 08 )
B. Generation of Data samples conditioned on specific sensitive attribute. ( 11 / 08 )
Action Items completed:
I will present today in the topic of “LLM and it’s application in Tabular Data”.
Replies:
Dr. Lei:
sounds good. i am very interested in hearing what you found out from your literature review first, in your presentation today. also what are the specific issues with conditional VAE?
Arjun:
Thank You Dr. Lei for your feedback. I will be able to complete the literature review by Friday. Furthermore, I will sample few datapoints from the CTVAE and present some results this Friday.
To ensure, I make incremental progress to the task, I will update the status on channel related to my research Wednesday evening.
Arjun:
Hi Group,
I have been following up on action items discussed in the last meeting. I will share some findings regarding LLM producing samples of tabular data for different prompts given to it which will take around ~15 mins. I plan to continue on the action items (sampling in latent space of VAE, generating adversarial examples on top of it ) and present the results in upcoming meetings.
My next milestone would be: Sample instances in latent space of VAE, generate adversarial examples on those samples to produce discriminatory samples and analyze results in each step (Oct 4 ).
Replies:
Dr. Lei:
this is not a good status updates, as I cannot get useful information out of it.
Arjun:
Hi Group,
I have been looking into the literature regarding Fairness and Clustering since the last meeting
( shared in the channel arjun-research ). Creating Baseline approach with the findings from last few weeks is taking little more time than expected.
I intend to present the summary of findings, approach and summary of relevant literature in the upcoming meeting.
Arjun:
Hello All,
Milestone: Present on literature of all existing work on Fairness testing with clustering ( sep 24 )
Action items for the week: I have been following up on action items ( Divide the data into further sub-clusters to get discriminatory and non-discriminatory clusters, Inspect the datapoints that were discriminatory and non-discriminatory in labels as well as prediction ) discussed in last meeting but haven’t been able to make much progress as I amn’t very well. I will present some findings in upcoming meeting on Friday.
Replies:
Dr. Lei:
it is probably time to look back at what you have found out, and then start on developing your own approach. sign up a technical discussion to summarize your findings and propose some possible ideas for your own approach.
Arjun:
Hi All,
Milestone:
Improve upon the baseline approach
I have been investigating the clusters that have all the discriminatory samples.
I have some findings that I would like to share in tomorrow’s meeting ( ~15-20 mins ).
Replies:
Dr. Lei:
try to make a presentation on the major approaches to fairness testing, especially on any existing work based on clustering
© 2023 Jeff Lei's Lab.
Site made with Jekyll. Template from the Allan Lab
We are part of the CSE Department at University of Texas at Arlington.