Posts
1576
Following
139
Followers
881
I'm a bit of an eclectic mess 🙂 I've been a programmer, journalist, editor, TV producer, and a few other things.

I'm currently working on my second novel which is complete, but is in the edit stage. I wrote my first novel over 20 years ago but then didn't write much till now.

I post about #Coding, #Flutter, #Writing, #Movies and #TV. I'll also talk about #Technology, #Gadgets, #MachineLearning, #DeepLearning and a few other things as the fancy strikes ...

Lived in: 🇱🇰🇸🇦🇺🇸🇳🇿🇸🇬🇲🇾🇦🇪🇫🇷🇪🇸🇵🇹🇶🇦🇨🇦

Fahim Farook

"Multi-Level Visual Similarity Based Personalized Tourist Attraction Recommendation Using Geo-Tagged Photos. (arXiv:2109.08275v2 [cs.MM] UPDATED)" — A geo-tagged photo based tourist attraction recommendation system which utilizes the visual contents of photos and interaction behavior data to obtain the final embeddings of users and tourist attractions, which are then used to predict the visit probabilities.

Paper: http://arxiv.org/abs/2109.08275
Code: https://github.com/revaludo/MEAL

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
An illustration of the multi-le…
0
0
0

Fahim Farook

"BiAdam: Fast Adaptive Bilevel Optimization Methods. (arXiv:2106.11396v3 [math.OC] UPDATED)" — A novel fast adaptive bilevel framework to solve stochastic bilevel optimization problems that the outer problem is possibly nonconvex and the inner problem is strongly convex.

Paper: http://arxiv.org/abs/2106.11396

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
The basic idea of the convergen…
0
0
0

Fahim Farook

"Sparse Oblique Decision Trees: A Tool to Understand and Manipulate Neural Net Features. (arXiv:2104.02922v2 [cs.LG] UPDATED)" — An effort to understanding which of the internal features computed by the neural net are responsible for a particular class, by mimicking part of the neural net with an oblique decision tree having sparse weight vectors at the decision nodes.

Paper: http://arxiv.org/abs/2104.02922

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Mimicking part of a neural net …
0
1
0

Fahim Farook

"Don't Play Favorites: Minority Guidance for Diffusion Models. (arXiv:2301.12334v1 [cs.LG])" — A framework that can make the generation process of the diffusion models focus on the minority samples, which are instances that lie on low-density regions of a data manifold.

Paper: http://arxiv.org/abs/2301.12334

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Diffusion models play favorites…
0
1
0

Fahim Farook

"SEGA: Instructing Diffusion using Semantic Dimensions. (arXiv:2301.12247v1 [cs.CV])" — A semantic guidance method for diffusion models to allow making subtle and extensive edits and changes in composition and style, as well as optimize the overall artistic conception.

Paper: http://arxiv.org/abs/2301.12247

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Semantic control over image gen…
0
1
0

Fahim Farook

"Anticipate, Ensemble and Prune: Improving Convolutional Neural Networks via Aggregated Early Exits. (arXiv:2301.12168v1 [cs.LG])" — A new training technique based on weighted ensembles of early exits, which aims at exploiting the information in the structure of networks to maximise their performance.

Paper: http://arxiv.org/abs/2301.12168

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Outline of the AEP technique. O…
0
1
0

Fahim Farook

"ClusterFuG: Clustering Fully connected Graphs by Multicut. (arXiv:2301.12159v1 [cs.CV])" — A simpler and potentially better performing graph clustering formulation based on multicut (a.k.a. weighted correlation clustering) on the complete graph.

Paper: http://arxiv.org/abs/2301.12159

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Example illustration of dense m…
0
1
0

Fahim Farook

"Towards Equitable Representation in Text-to-Image Synthesis Models with the Cross-Cultural Understanding Benchmark (CCUB) Dataset. (arXiv:2301.12073v1 [cs.CV])" — A culturally-aware priming approach for text-to-image synthesis using a small but culturally curated dataset to fight the bias prevalent in giant datasets.

Paper: http://arxiv.org/abs/2301.12073

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Sample images generated for fiv…
0
1
0

Fahim Farook

"Cross-Architectural Positive Pairs improve the effectiveness of Self-Supervised Learning. (arXiv:2301.12025v1 [cs.CV])" — A novel self-supervised learning approach that leverages Transformer and CNN simultaneously to overcome the issues with existing self-supervised techniques which have extreme computational requirements and suffer a substantial drop in performance with a reduction in batch size or pretraining epochs.

Paper: http://arxiv.org/abs/2301.12025

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
In our proposed self-supervised…
0
2
1

Fahim Farook

"Improved knowledge distillation by utilizing backward pass knowledge in neural networks. (arXiv:2301.12006v1 [cs.LG])" — Addressing the issue with Knowledge Distillation (KD) where there is no guarantee that the model would match in areas for which you do not have enough training samples, by generating new auxiliary training samples based on extracting knowledge from the backward pass of the teacher in the areas where the student diverges greatly from the teacher.

Paper: http://arxiv.org/abs/2301.12006

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
a) Minimization Step: Using the…
0
1
0

Fahim Farook

"RGB Arabic Alphabets Sign Language Dataset. (arXiv:2301.11932v1 [cs.CV])" — An Arabic Alphabet Sign Language (AASL) dataset comprising of 7,856 raw and fully labelled RGB images of the Arabic sign language alphabets which might be the first such publicly available dataset.

Paper: http://arxiv.org/abs/2301.11932

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Sample images from the dataset
0
1
1

Fahim Farook

A total of 89 papers in the cs.CV category on arXiv.org today — 37 new, 52 updated.

Lots of updates today ... So on to seeing if there's anything interesting in there 🙂

#AI #CV #NewPapers #DeepLearning #MachineLearning
0
0
0

The worst thing about having your password stolen is having to rename the dog.

11
6
0

Fahim Farook

I’m thinking of writing a serialised story on here since Akkoma gives me plenty of space for writing. Maybe do something where each instalment is about a 1,000 words, accompany it with StableDiffusion images (or images generated by StableDiffusion that my wife modifies/tweaks — we talked about it, but still figuring out that part …) and do a part every few days or something?

I have a story brewing in my head which is perfect for this but just need to find the time. What with work and the AI papers, weekdays are kind of full and I really need the time for things to percolate properly if I want to write and it’s fun. So maybe write over the weekends and publish over the week?

Still thinking about this … It’s a fun story that both my wife and I have worked on for years, but this is a totally different take on that, and so essentially, a new story 🙂
1
1
1

Fahim Farook

Tusker is really coming into its own for me. I’ve done a bunch of changes to make it work better on macOS as a Catalyst app (mostly to do with remembering placement of the windows and such) and have added in notifications since I forget to check for new posts if there’s no badge on the app …

Still have a few things I’d like to fix but overall, I still like the app and how much easier it is to modify than most of the other Mastodon app code I’ve looked at recently.

So looks as if I’m going with Tusker and will continue to tinker with it. Most of the stuff I’ve added is rough and needs polishing but then I need to find the time to do that, don’t I? 😛
0
0
0

We're looking for a product manager for our digital archive data services at The National Archives (UK). Permanent, full-time (though job share and part-time also considered), hybrid working (2 days a month onsite usually the minimum), senior executive officer grade, £47,000 plus benefits https://www.civilservicejobs.service.gov.uk/csr/jobs.cgi?jcode=1836179

2
2
0

Hey Mastodon, I’m trying to help find someone doing research involving human facial EMG recording. (e.g. levator labii and corrugator signals) You know anyone? Heard any rumors?

0
2
0

Arturo, chilly this morning but making an effort to be adorable as ever.

0
2
0

“[E]verything is a challenge. You have to answer challenge with creative effort. That’s the only thing you can do.”

— Isamu Noguchi, Japanese-American artist, sculptor, designer, landscape architect (1904-1988)

1
2
0

Happy hummingbird zoomies

0
5
1
Show older