Posts
1368
Following
141
Followers
868
I'm a bit of an eclectic mess 🙂 I've been a programmer, journalist, editor, TV producer, and a few other things.

I'm currently working on my second novel which is complete, but is in the edit stage. I wrote my first novel over 20 years ago but then didn't write much till now.

I post about #Coding, #Flutter, #Writing, #Movies and #TV. I'll also talk about #Technology, #Gadgets, #MachineLearning, #DeepLearning and a few other things as the fancy strikes ...

Lived in: 🇱🇰🇸🇦🇺🇸🇳🇿🇸🇬🇲🇾🇦🇪🇫🇷🇪🇸🇵🇹🇶🇦🇨🇦

Fahim Farook

"Computer Vision for a Camel-Vehicle Collision Mitigation System. (arXiv:2301.09339v1 [cs.CV])" — Testing different object detection models on the task of detecting camels on the road since in Saudi Arabia, due to the size of camels, camel-vehicle collisions result in a 25% fatality rate.

Paper: http://arxiv.org/abs/2301.09339

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Example images from the dataset…
0
1
0

Fahim Farook

"Apples and Oranges? Assessing Image Quality over Content Recognition. (arXiv:2301.09190v1 [cs.CV])" — An investigation of whether image recognition and quality assessment can be performed in a multitask learning manner.

Paper: http://arxiv.org/abs/2301.09190

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Architecture of the proposed IQ…
0
2
0

Fahim Farook

"Raw or Cooked? Object Detection on RAW Images. (arXiv:2301.08965v1 [cs.CV])" — An investigation of the hypothesis that the intermediate representation of visually pleasing images is sub-optimal for downstream computer vision tasks compared to the RAW image representation.

Paper: http://arxiv.org/abs/2301.08965

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Three qualitative examples from…
0
3
2

Fahim Farook

"Domain-agnostic and Multi-level Evaluation of Generative Models" — A framework for multi-level performance evaluation of generative models which could be employed across different domains (images, text, graphs, molecules, etc.)

Paper: https://arxiv.org/abs/2301.08750

#AI #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Overview of the MPEGO framework…
0
1
0

Fahim Farook

"Improving Deep Regression with Ordinal Entropy. (arXiv:2301.08915v1 [cs.CV])" — An investigation of the fact that in computer vision, formulating regression problems as a classification task often yields better performance, and shows that classification, with the cross-entropy loss, outperforms regression with a mean squared error loss in its ability to learn high-entropy feature representations.

Paper: http://arxiv.org/abs/2301.08915

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Feature learning of regression …
0
1
2

Fahim Farook

"A Large-scale Film Style Dataset for Learning Multi-frequency Driven Film Enhancement. (arXiv:2301.08880v1 [cs.CV])" — A large-scale and high-quality film style dataset to facilitate film-based image stylization research. The dataset includes three different film types and more than 5000 in-the-wild high resolution images.

Paper: http://arxiv.org/abs/2301.08880

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
This figure contains input imag…
0
1
0

Fahim Farook

"Regeneration Learning: A Learning Paradigm for Data Generation. (arXiv:2301.08846v1 [cs.LG])" — A learning paradigm for data generation (e.g., text generation, speech recognition, speech synthesis, music composition, image generation, and video generation) which first generates Y' (an abstraction/representation of the target data, Y) from the source data, X, and then generates Y from Y'.

Paper: http://arxiv.org/abs/2301.08846

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Three types of tasks in machine…
0
1
0

Fahim Farook

"In-situ Water quality monitoring in Oil and Gas operations. (arXiv:2301.08800v1 [cs.CV])" — a model designed to enable users to determine contamination levels in water bodies with weak reflectance patterns such as small ponds based on satellite images.

Paper: http://arxiv.org/abs/2301.08800

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Image “a”shows data sample from…
0
1
0

Fahim Farook

"Visual Semantic Relatedness Dataset for Image Captioning. (arXiv:2301.08784v1 [cs.CL])" — A textual visual context dataset for captioning, in which the publicly available dataset COCO Captions has been extended with information about the scene (such as objects in the image).

Paper: http://arxiv.org/abs/2301.08784

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Examples of our proposed COCO b…
0
1
0

Fahim Farook

This particular #StableDiffusion prompt based on Terry Pratchett novel titles was in the works for a few days — I just wasn't sure about the results since most of them were fairly similar ...

The prompt? "Wintersmith"

Predictably, most of the results were people in winter-wear. I just didn't like the monotony and hence the inclusion of the fox from what looks like the cover of a box of pencils 😛

#AIArt #DeepLearning #MachineLearning #CV #AI #DiscWorld
Stable Diffusion prompt: "Winte…
Stable Diffusion prompt: "Winte…
Stable Diffusion prompt: "Winte…
Stable Diffusion prompt: "Winte…
0
0
1

Fahim Farook

"Model Complexity-Accuracy Trade-off for a Convolutional Neural Network. (arXiv:1705.03338v1 [cs.CV] CROSS LISTED)" — A study of the model complexity versus accuracy trade-off on MNSIT dataset, providing a concrete framework for handling such a problem, given the worst case accuracy that a system can tolerate.

Paper: http://arxiv.org/abs/1705.03338

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
A graph plotting Accuracy vs Mo…
0
1
0

Fahim Farook

"MemeTector: Enforcing deep focus for meme detection. (arXiv:2205.13268v2 [cs.CV] UPDATED)" — A methodology that utilizes the visual part of image memes as instances of the regular image class and the initial image memes as instances of the image meme class to force the model to concentrate on the critical parts that characterize an image meme.

Paper: http://arxiv.org/abs/2205.13268
Code: https://github.com/mever-team/memetector

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Example image meme vs. a regula…
0
1
0

Fahim Farook

"Learning Sequential Latent Variable Models from Multimodal Time Series Data. (arXiv:2204.10419v2 [cs.LG] UPDATED)" — A self-supervised generative modelling framework to jointly learn a probabilistic latent state representation of multimodal data and the respective dynamics to improve prediction and representation quality.

Paper: http://arxiv.org/abs/2204.10419
Code: https://github.com/utiasstars/visual-haptic-dynamics

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
We learn a sequential latent va…
0
2
1

Fahim Farook

"REx: Data-Free Residual Quantization Error Expansion. (arXiv:2203.14645v2 [cs.CV] UPDATED)" — A quantization method that leverages residual error expansion, along with group sparsity and an ensemble approximation for better parallelization.

Paper: http://arxiv.org/abs/2203.14645

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Illustration of the proposed me…
0
1
0

Fahim Farook

"Novel-View Acoustic Synthesis. (arXiv:2301.08730v1 [cs.CV])" — Given the sight and sound observed at a source viewpoint, synthesizing the *sound* of that scene from an unseen target viewpoint using a neural rendering approach.

Paper: http://arxiv.org/abs/2301.08730

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Novel-view acoustic synthesis t…
0
2
0

Fahim Farook

"Visual Writing Prompts: Character-Grounded Story Generation with Curated Image Sequences. (arXiv:2301.08571v1 [cs.CL])" — A new image-grounded dataset for improving visual story generation due to the fact that existing image sequence collections do not have coherent plots behind them.

Paper: http://arxiv.org/abs/2301.08571

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Comparison of Visual Writing Pr…
0
2
3

Fahim Farook

"When Source-Free Domain Adaptation Meets Label Propagation. (arXiv:2301.08413v1 [cs.CV])" — An approach that tries to achieve efficient feature clustering from the perspective of label propagation by dividing the target data into inner and outlier samples based on the adaptive threshold of the learning state, and applying a customized learning strategy to best fits the data property.

Paper: http://arxiv.org/abs/2301.08413

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
A toy illustration of target fe…
0
1
0

Fahim Farook

"Open-Set Likelihood Maximization for Few-Shot Learning. (arXiv:2301.08390v1 [cs.CV])" — A generalization of the maximum likelihood principle, in which latent scores down-weighing the influence of potential outliers are introduced alongside the usual parametric model. This implementation can be applied on top of any pre-trained model seamlessly.

Paper: http://arxiv.org/abs/2301.08390

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Intuition behind OSLO. Standard…
0
1
2
Show older