Posts
1411
Following
142
Followers
868
I'm a bit of an eclectic mess 🙂 I've been a programmer, journalist, editor, TV producer, and a few other things.

I'm currently working on my second novel which is complete, but is in the edit stage. I wrote my first novel over 20 years ago but then didn't write much till now.

I post about #Coding, #Flutter, #Writing, #Movies and #TV. I'll also talk about #Technology, #Gadgets, #MachineLearning, #DeepLearning and a few other things as the fancy strikes ...

Lived in: 🇱🇰🇸🇦🇺🇸🇳🇿🇸🇬🇲🇾🇦🇪🇫🇷🇪🇸🇵🇹🇶🇦🇨🇦

Fahim Farook

"Explore the Power of Dropout on Few-shot Learning. (arXiv:2301.11015v1 [cs.CV])" — An exploration of the power of the dropout regularization technique on few-shot learning and provide some insights about how to use it.

Paper: http://arxiv.org/abs/2301.11015

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
The generalization power of tra…
0
1
0

Fahim Farook

"On the Importance of Noise Scheduling for Diffusion Models. (arXiv:2301.10972v1 [cs.CV])" — A study of the effect of noise scheduling strategies for denoising diffusion generative models which finds that the noise scheduling is crucial for performance.

Paper: http://arxiv.org/abs/2301.10972

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Random samples generated by our…
0
1
0

Fahim Farook

"ITstyler: Image-optimized Text-based Style Transfer. (arXiv:2301.10916v1 [cs.CV])" — A data-efficient text-based style transfer method that does not require optimization at the inference stage where the text input is converted to the style space of the pre-trained VGG network to realize a more effective style swap.

Paper: http://arxiv.org/abs/2301.10916

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Overview stylization results of…
0
1
0

Fahim Farook

"Distilling Cognitive Backdoor Patterns within an Image. (arXiv:2301.10908v1 [cs.LG])" — A simple method to distill and detect backdoor patterns within an image by extracting the "minimal essence" from an input image responsible for the model's prediction.

Paper: http://arxiv.org/abs/2301.10908
Code: https://github.com/HanxunH/CognitiveDistillation

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
On the left, First row: a clean…
0
1
0

Fahim Farook

"Differentiable Prompt Makes Pre-trained Language Models Better Few-shot Learners. (arXiv:2108.13161v7 [cs.CL] CROSS LISTED)" — A novel pluggable, extensible, and efficient large-scale pre-trained language model approach which can convert small language models into better few-shot learners without any prompt engineering.

Paper: http://arxiv.org/abs/2108.13161
Code: https://github.com/zjunlp/DART

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
The architecture of DifferentiA…
0
3
2

Fahim Farook

"VN-Transformer: Rotation-Equivariant Attention for Vector Neurons. (arXiv:2206.04176v3 [cs.CV] UPDATED)" — A novel "VN-Transformer" architecture to address several shortcomings of the current Vector Neuron (VN) models.

Paper: http://arxiv.org/abs/2206.04176

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
VN-Transformer (“early fusion”)…
0
1
0

Fahim Farook

"Towards Arbitrary Text-driven Image Manipulation via Space Alignment. (arXiv:2301.10670v1 [cs.CV])" — A new Text-driven image Manipulation framework via Space Alignment (TMSA) which aims to align the same semantic regions in CLIP and StyleGAN spaces so that the text input can be directly accessed into the StyleGAN space and be used to find the semantic shift according to the text description.

Paper: http://arxiv.org/abs/2301.10670

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Overview of image manipulation …
0
1
0

Fahim Farook

"Toward Realistic Evaluation of Deep Active Learning Algorithms in Image Classification. (arXiv:2301.10625v1 [cs.CV])" — An Active Learning (AL) benchmarking suite and a set of extensive experiments on five datasets shedding light on the questions: when and how to apply AL.

Paper: http://arxiv.org/abs/2301.10625

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
The Active Learning loop featur…
0
2
0

Fahim Farook

"Variation-Aware Semantic Image Synthesis. (arXiv:2301.10551v1 [cs.CV])" — A new requirement for semantic image synthesis (SIS) to achieve more photorealistic images, variation-aware, which consists of inter- and intra-class variation.

Paper: http://arxiv.org/abs/2301.10551

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Examples of class-level mode co…
0
1
0

Fahim Farook

"DEJA VU: Continual Model Generalization For Unseen Domains. (arXiv:2301.10418v1 [cs.LG])" — A framework that focuses on improving models' target domain generalization (TDG) capability, while also achieving effective target domain adaptation (TDA) capability right after training on certain domains and forgetting alleviation (FA) capability on past domains.

Paper: http://arxiv.org/abs/2301.10418

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Overview of applying our framew…
0
3
1

Fahim Farook

"Local Feature Extraction from Salient Regions by Feature Map Transformation. (arXiv:2301.10413v1 [cs.CV])" — A framework that robustly extracts and describes salient local features regardless of changing light and viewpoints by suppressing illumination variations and encouraging structural information to ignore the noise from light and to focus on edges.

Paper: http://arxiv.org/abs/2301.10413

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Proposed framework. Network con…
0
1
0

Fahim Farook

"A Fast Feature Point Matching Algorithm Based on IMU Sensor. (arXiv:2301.10293v1 [cs.CV])" — An algorithm using the inertial measurement unit (IMU) to optimize the efficiency of image feature point matching, in order to reduce the time consumed when matching feature points in simultaneous localization and mapping (SLAM).

Paper: http://arxiv.org/abs/2301.10293

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Two adjacent frames. The left i…
0
1
1

Fahim Farook

Yesterday's #StableDiffusion prompt was: "Unseen Academicals".

The results were the most diverse and furthest away from anything to do with the original #DiscWorld title that I've gotten so far 🙂 But on the other hand, the prompt is rather abstract ...

I am reaching the end of the DiscWorld title list and it might be time to take a break from DiscWorld and try something else for a few days?

#AIArt #DeepLearning #MachineLearning #CV #AI
Stable Diffusion prompt: "Unsee…
Stable Diffusion prompt: "Unsee…
Stable Diffusion prompt: "Unsee…
Stable Diffusion prompt: "Unsee…
1
1
4

Fahim Farook

"Intelligent Painter: Picture Composition With Resampling Diffusion Model. (arXiv:2210.17106v2 [cs.CV] UPDATED)" — An intelligent painter that generate a person's imaginary scene in one go, given explicit hints as to the objects to appear in the final image, their placement in the image etc.

Paper: http://arxiv.org/abs/2210.17106
Code: https://github.com/vinesmsuic/ipainter-diffusion

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
The architecture of the propose…
0
2
1

Fahim Farook

"Provably Convergent Plug & Play Linearized ADMM, applied to Deblurring Spatially Varying Kernels. (arXiv:2210.10605v3 [cs.CV] UPDATED)" — A plug & play framework based on linearized ADMM that allows you to bypass the computation of intractable proximal operators.

Paper: http://arxiv.org/abs/2210.10605
Code: https://github.com/claroche-r/pnp_ladmm

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Example of sample from the test…
0
1
1

Fahim Farook

"Self-Supervised Learning Through Efference Copies. (arXiv:2210.09224v2 [cs.LG] UPDATED)" — A paper which shows that the commonly used self-supervised learning approach of transforming each training datapoint into a pair of views, using the knowledge of this pairing as a positive (i.e. non-contrastive) self-supervisory sign, and potentially opposing it to unrelated, (i.e. contrastive) negative examples is an incomplete implementation of a concept from neuroscience, the Efference Copy (EC).

Paper: http://arxiv.org/abs/2210.09224

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Efference copy (EC). A) Sensory…
0
0
0

Fahim Farook

"Continual Transformers: Redundancy-Free Attention for Online Inference. (arXiv:2201.06268v3 [cs.AI] UPDATED)" — A novel formulations of the Scaled Dot-Product Attention, which enable Transformers to perform efficient online token-by-token inference on a continual input stream.

Paper: http://arxiv.org/abs/2201.06268
Code: https://github.com/lukashedegaard/continual-transformers

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Multi-block Continual Transform…
0
2
3

Fahim Farook

"Model soups to increase inference without increasing compute time. (arXiv:2301.10092v1 [cs.CV])" — A comparison of Model Soups performances on three different models (ResNet, ViT and EfficientNet) using three Soup Recipes (Greedy Soup Sorted, Greedy Soup Random and Uniform soup) and a new Soup Recipe called Pruned Soup with better performance.

Paper: http://arxiv.org/abs/2301.10092
Code: https://github.com/milo-sobral/modelsoup

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
A table showing the accuracies …
0
1
0

Fahim Farook

"Side Eye: Characterizing the Limits of POV Acoustic Eavesdropping from Smartphone Cameras with Rolling Shutters and Movable Lenses. (arXiv:2301.10056v1 [cs.CR])" — A paper discussing the limits of acoustic information leakage caused by structure-borne sound that perturbs the POV of smartphone cameras.

Paper: http://arxiv.org/abs/2301.10056

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Illustration of the POV optical…
0
0
0

Fahim Farook

"Progressive Meta-Pooling Learning for Lightweight Image Classification Model. (arXiv:2301.10038v1 [cs.CV])" — A framework to make the receptive field learnable for a lightweight network, unlike the conventional efficient learning methods which ignore the role of the receptive field in neural network design, by using parameterized pooling-based operations.

Paper: http://arxiv.org/abs/2301.10038

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Overview of the proposed Meta-P…
0
1
1
Show older