Posts
1606
Following
138
Followers
882
I'm a bit of an eclectic mess 🙂 I've been a programmer, journalist, editor, TV producer, and a few other things.

I'm currently working on my second novel which is complete, but is in the edit stage. I wrote my first novel over 20 years ago but then didn't write much till now.

I post about #Coding, #Flutter, #Writing, #Movies and #TV. I'll also talk about #Technology, #Gadgets, #MachineLearning, #DeepLearning and a few other things as the fancy strikes ...

Lived in: 🇱🇰🇸🇦🇺🇸🇳🇿🇸🇬🇲🇾🇦🇪🇫🇷🇪🇸🇵🇹🇶🇦🇨🇦

Fahim Farook

"SIViDet: Salient Image for Efficient Weaponized Violence Detection. (arXiv:2207.12850v4 [cs.CV] UPDATED)" — A new dataset that contains videos depicting weaponized violence, non-weaponized violence, and non-violent events; and a proposal for a novel data-centric method that arranges video frames into salient images while minimizing information loss for comfortable inference by SOTA image classifiers.

Paper: http://arxiv.org/abs/2207.12850
Code: https://github.com/Ti-Oluwanimi/Violence_Detection

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Salient Image: A sequence of vi…
0
1
0

Fahim Farook

"BridgeTower: Building Bridges Between Encoders in Vision-Language Representation Learning. (arXiv:2206.08657v3 [cs.CV] UPDATED)" — A proposal for multiple bridge layers that build a connection between the top layers of uni-modal encoders and each layer of the cross-modal encoder.

Paper: http://arxiv.org/abs/2206.08657
Code: https://github.com/microsoft/BridgeTower

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
(a) – (d) are four categories o…
0
1
0

Fahim Farook

"Text-To-4D Dynamic Scene Generation. (arXiv:2301.11280v1 [cs.CV])" — A method for generating three-dimensional dynamic scenes from text descriptions which uses a 4D dynamic Neural Radiance Field (NeRF), which is optimized for scene appearance, density, and motion consistency by querying a Text-to-Video (T2V) diffusion-based model.

Paper: http://arxiv.org/abs/2301.11280

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Samples generated by MAV3D alon…
0
1
1

Fahim Farook

"BiBench: Benchmarking and Analyzing Network Binarization. (arXiv:2301.11233v1 [cs.CV])" — A rigorously designed benchmark with in-depth analysis for network binarization where they scrutinize the requirements of binarization in the actual production and define evaluation tracks and metrics for a comprehensive investigation.

Paper: http://arxiv.org/abs/2301.11233

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Evaluation tracks of BiBench. O…
0
1
0

Fahim Farook

"Improving Statistical Fidelity for Neural Image Compression with Implicit Local Likelihood Models. (arXiv:2301.11189v1 [eess.IV])" — A non-binary discriminator that is conditioned on quantized local image representations obtained via VQ-VAE autoencoders, for lossy image compression.

Paper: http://arxiv.org/abs/2301.11189

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Comparison of distortion vs. st…
0
0
0

Fahim Farook

"Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring. (arXiv:2301.11116v1 [cs.CV])" — A look at temporal modeling in the context of image-to-video knowledge transferring, which is the key point for extending image-text pretrained models to the video domain.

Paper: http://arxiv.org/abs/2301.11116

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
(a) Illustration of temporal mo…
0
1
0

Fahim Farook

"Explaining Visual Biases as Words by Generating Captions. (arXiv:2301.11104v1 [cs.LG])" — Diagnosing the potential biases in image classifiers by leveraging two types (generative and discriminative) of pre-trained vision-language models to describe the visual bias as a word.

Paper: http://arxiv.org/abs/2301.11104

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Concept of the proposed bias-to…
0
1
0

Fahim Farook

"Vision-Language Models Performing Zero-Shot Tasks Exhibit Gender-based Disparities. (arXiv:2301.11100v1 [cs.CV])" — An exploration of the extent to which zero-shot vision-language models exhibit gender bias for different vision tasks.

Paper: http://arxiv.org/abs/2301.11100

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
(a) The average precision (AP) …
0
0
0

Fahim Farook

"simple diffusion: End-to-end diffusion for high resolution images. (arXiv:2301.11093v1 [cs.CV])" — Improve denoising diffusion for high resolution images while keeping the model as simple as possible and obtaining performance comparable to the latent diffusion-based approaches?

Paper: http://arxiv.org/abs/2301.11093

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
A dslr photo of a frog wearing…
0
1
1

Another of my fluid abstractions. 'Awestruck.' Think I should just call them my restlessness series since i only draw them when I'm stuck wanting to do 5 things at once and not being able to prioritize.

0
2
0

I went to two exhibits this week; one by Sidne Teske, and another by Phyllis Shafer; my second visit before the show closes tomorrow. So fortunate to have opportunities to see amazing artwork in person. Always make me want to hurry home and get to work in my studio. Artwork is hard work.

0
2
0

Fahim Farook

"Rewarded meta-pruning: Meta Learning with Rewards for Channel Pruning. (arXiv:2301.11063v1 [cs.CV])" — A method to reduce the parameters and FLOPs for computational efficiency in deep learning models by introducing accuracy and efficiency coefficients to control the trade-off between the accuracy of the network and its computing efficiency.

Paper: http://arxiv.org/abs/2301.11063

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Reward at varying Accuracies an…
0
1
0

Fahim Farook

"Explore the Power of Dropout on Few-shot Learning. (arXiv:2301.11015v1 [cs.CV])" — An exploration of the power of the dropout regularization technique on few-shot learning and provide some insights about how to use it.

Paper: http://arxiv.org/abs/2301.11015

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
The generalization power of tra…
0
1
0

Fahim Farook

"On the Importance of Noise Scheduling for Diffusion Models. (arXiv:2301.10972v1 [cs.CV])" — A study of the effect of noise scheduling strategies for denoising diffusion generative models which finds that the noise scheduling is crucial for performance.

Paper: http://arxiv.org/abs/2301.10972

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Random samples generated by our…
0
1
0

Fahim Farook

"ITstyler: Image-optimized Text-based Style Transfer. (arXiv:2301.10916v1 [cs.CV])" — A data-efficient text-based style transfer method that does not require optimization at the inference stage where the text input is converted to the style space of the pre-trained VGG network to realize a more effective style swap.

Paper: http://arxiv.org/abs/2301.10916

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Overview stylization results of…
0
1
0

Fahim Farook

"Distilling Cognitive Backdoor Patterns within an Image. (arXiv:2301.10908v1 [cs.LG])" — A simple method to distill and detect backdoor patterns within an image by extracting the "minimal essence" from an input image responsible for the model's prediction.

Paper: http://arxiv.org/abs/2301.10908
Code: https://github.com/HanxunH/CognitiveDistillation

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
On the left, First row: a clean…
0
1
0

Fahim Farook

A total of 55 papers in the cs.CV category on arXiv.org today — 44 new, 11 updated.

And off we go to see the wonderful wiz ... erm ... wrong one ... off we go to read the papers 😛

#AI #CV #NewPapers #DeepLearning #MachineLearning
0
0
0

I told a friend that her skin looked fabulous. She said “thanks, I grew it myself.“ This is exactly the kind of person I strive diligently to keep around me.

0
2
0

It’s a beautiful daily routine tiptoeing down to the to supplement the native frog ’ diet in the early mornings.

Luckily, the everyday of fresh for lunch includes their favorite (baby spinach). Many other greens are growing, but regular hand picking allows spells for regrowth, and we only plant what will actually grow in this environment. Some are volunteers. Today’s harvest in midsummer coastal includes:

Spinach, silver beet, kale, rocket, cress, sorrel, Asian greens, spring onion, basil, parsley, sage, rosemary, thyme, coriander, mint, nasturtium, leek, green beans, tomato, capsicum, baby eggplant, radish, edible flowers, together with a handful of wild greens grown mostly to attract beneficial insects with their flowers.

Orchard caterpillars have appeared on the citrus, to increase the neighborhood butterfly population.

0
2
0

Fahim Farook

I’m extremely pleased with Tusker so far. I can’t come up with specifics as to why it’s such a great experience, but after having tried multiple Mastodon clients (on macOS, web, and iOS) this is the first one that I’m so happy with.

I’ve done some tweaks to the UI so far and there is a bunch of other things that I (or the wife) want tweaked, but even without the pending tweaks, I’m happier with the overall experience of Tusker than I’ve been with any of the other clients I’ve tried 🙂

I just need to get some of these issues fixed (like not being able to use the emoji picker in the post editor) and I’ll be golden!
0
0
2
Show older