Posts
1411
Following
142
Followers
869
I'm a bit of an eclectic mess 🙂 I've been a programmer, journalist, editor, TV producer, and a few other things.

I'm currently working on my second novel which is complete, but is in the edit stage. I wrote my first novel over 20 years ago but then didn't write much till now.

I post about #Coding, #Flutter, #Writing, #Movies and #TV. I'll also talk about #Technology, #Gadgets, #MachineLearning, #DeepLearning and a few other things as the fancy strikes ...

Lived in: 🇱🇰🇸🇦🇺🇸🇳🇿🇸🇬🇲🇾🇦🇪🇫🇷🇪🇸🇵🇹🇶🇦🇨🇦

Fahim Farook

"ChatCAD: Interactive Computer-Aided Diagnosis on Medical Image using Large Language Models. (arXiv:2302.07257v1 [cs.CV])" — Using Large Language Models (LLM) with Computer-Aided Diagnosis (CAD) networks to enhance the output of CAD networks by summarizing and presenting the information in a more understandable format.

Paper: http://arxiv.org/abs/2302.07257

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Overview of our proposed strate…
0
0
0

Fahim Farook

"Painting 3D Nature in 2D: View Synthesis of Natural Scenes from a Single Semantic Mask. (arXiv:2302.07224v1 [cs.CV])" — Using a semantic mask as input to generate photorealistic color images of natural scenes.

Paper: http://arxiv.org/abs/2302.07224

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Given only a single semantic ma…
0
1
0

Fahim Farook

"Universal Guidance for Diffusion Models. (arXiv:2302.07121v1 [cs.CV])" — a universal guidance algorithm that enables diffusion models to be controlled by arbitrary guidance methods such as segmentation, face recognition, object detection, and classifier signals, without the need to retrain for that specific method.

Paper: http://arxiv.org/abs/2302.07121
Code: https://github.com/arpitbansal297/Universal-Guided-Diffusion

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Examples of diffusion being gui…
0
1
2

Fahim Farook

Now that I've got a working workflow for StableDiffusion image generation again, I can finally get back to the long neglected #DiscWorld novel title series ...

Yesterday's prompt was: "I Shall Wear Midnight"

Not much DiscWorld-iness in the images, but they all were suitably dark to represent midnight, I guess 😛

#AIArt #StableDiffusion #DeepLearning #MachineLearning #CV #AI #DiscWorld
Prompt: “I Shall Wear Midnight”…
Prompt: “I Shall Wear Midnight”…
Prompt: “I Shall Wear Midnight”…
Prompt: “I Shall Wear Midnight”…
1
2
6

Fahim Farook

"Exploiting Cultural Biases via Homoglyphs in Text-to-Image Synthesis. (arXiv:2209.08891v2 [cs.CV] UPDATED)" — An interesting look at how replacing Latin characters with non-Latin (visual) equivalents, generative models reflect cultural stereotypes and biases in their generated images.

Paper: http://arxiv.org/abs/2209.08891

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Example of homoglyph manipulati…
0
1
0

Fahim Farook

"Dark solitons in Bose-Einstein condensates: a dataset for many-body physics research. (arXiv:2205.09114v2 [cond-mat.quant-gas] UPDATED)" — A dataset of over 1.6×10⁴ experimental images of Bose-Einstein condensates containing solitonic excitations to enable machine learning (ML) for many-body physics research.

Paper: http://arxiv.org/abs/2205.09114

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
(a)–(c) Raw data from which dat…
0
0
1

Fahim Farook

"I²SB: Image-to-Image Schrödinger Bridge. (arXiv:2302.05872v1 [cs.CV])" — A new class of conditional diffusion models that directly learn the nonlinear diffusion processes between two given distributions.

Paper: http://arxiv.org/abs/2302.05872

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Outputs of our proposed Image-t…
0
3
2

Fahim Farook

"Heckerthoughts" — A manuscript going through the basic concepts central to AI and Machine Learning where the author claims that some concepts he included are missing from modern ML courses. Conversational style, anecdotal, and not too long at 54 pages. Worth reading …

Paper: https://arxiv.org/abs/2302.05449

#AI #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
The “burst” of emails that prom…
0
4
1

Fahim Farook

"Stochastic Surprisal: An inferential measurement of Free Energy in Neural Networks. (arXiv:2302.05776v1 [cs.LG])" — A framework that allows for action during inference in supervised neural networks.

Paper: http://arxiv.org/abs/2302.05776
Code: https://github.com/olivesgatech/Stochastic-Surprisal

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Stochastic surprisal answers co…
0
1
1

Fahim Farook

"Adding Conditional Control to Text-to-Image Diffusion Models. (arXiv:2302.05543v1 [cs.CV])" — A method to control pretrained large diffusion models to support additional input conditions which can be used to augment existing generative models such as StableDiffusion by enabling conditional inputs like edge maps, segmentation maps, keypoints, etc.

Paper: http://arxiv.org/abs/2302.05543
Code: https://github.com/lllyasviel/ControlNet

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Control Stable Diffusion with C…
0
2
0

Fahim Farook

"MaskSketch: Unpaired Structure-guided Masked Image Generation. (arXiv:2302.05496v1 [cs.CV])" — An image generation method that uses a guiding sketch to generate realistic images that match the structure of the sketch.

Paper: http://arxiv.org/abs/2302.05496

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Given an input sketch and its c…
0
1
0

Fahim Farook

"Element-Wise Attention Layers: an option for optimization. (arXiv:2302.05488v1 [cs.LG])" — A new method of attention mechanism which uses matrices multiplications and has shown 92% accuracy and a 97% reduction in parameters for the Fashion MNIST dataset.

Paper: http://arxiv.org/abs/2302.05488

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
The outputs of the attention mo…
0
1
0

Fahim Farook

I haven't been posting any #StableDiffusion images since I was busy coding and writing and doing other stuff ...

But I've been generating images. These are from prompts ranging from "a cross between an egg and a rabbit" to "a creature with a large white egg as its body, white rabbit ears, a whiskered face, and rabbit paws and legs, blue sky with clouds, lots of trees in the background" ...

I was trying to get one image to match a story I was writing but I had to try various prompts to get something I was even happy with. And in the end, I didn't use any of these images 😛

#AIArt #StableDiffusion #DeepLearning #MachineLearning #CV #AI
Prompt: “a cross between an egg…
Prompt: “a creature with a larg…
Prompt: “a creature with a larg…
Prompt: “a creature with a larg…
0
0
2

Fahim Farook

"Plausible May Not Be Faithful: Probing Object Hallucination in Vision-Language Pre-training. (arXiv:2210.07688v2 [cs.CL] UPDATED)" — A study of the object hallucination problem in large-scale Vision-Language Pre-trained (VLP) models from multiple aspects.

Paper: http://arxiv.org/abs/2210.07688
Code: No code in linked repo

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Comparison of image captioning …
0
1
0

Fahim Farook

"Dive into Deep Learning. (arXiv:2106.11342v4 [cs.LG] UPDATED)" — An open-source book on Deep Learning based on Jupyter Notebooks so that it contains interactive examples. Freely available and well-worth checking out.

Paper: http://arxiv.org/abs/2106.11342
Code: https://github.com/d2l-ai/d2l-en
Book: https://d2l.ai/

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Image of book cover for Dive in…
0
5
8

Fahim Farook

"Scaling Vision Transformers to 22 Billion Parameters. (arXiv:2302.05442v1 [cs.CV])" — A recipe for highly efficient and stable training of a 22B-parameter Vision Transformers (ViT) overtaking the previously known 4B parameter model.

Paper: http://arxiv.org/abs/2302.05442

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Dense prediction from frozen Vi…
0
2
1

Fahim Farook

"Rumor Classification through a Multimodal Fusion Framework and Ensemble Learning. (arXiv:2302.05289v1 [cs.CV])" — A set of advanced image features that are inspired from the field of image quality assessment, to assess message veracIty in social networks, which exploits all message features by exploring various machine learning models.

Paper: http://arxiv.org/abs/2302.05289

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
An overview of the proposed rum…
0
2
1

Fahim Farook

"Archaeological Sites Detection with a Human-AI Collaboration Workflow. (arXiv:2302.05286v1 [cs.CV])" — Using pre-trained semantic segmentation deep learning models to detect archaeological sites within the Mesopotamian floodplains environment.

Paper: http://arxiv.org/abs/2302.05286
Code: https://github.com/mister-magpie/tell_segmentation

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Investigation area. Orange dots…
0
8
8

Fahim Farook

"CEN-HDR: Computationally Efficient neural Network for real-time High Dynamic Range imaging. (arXiv:2302.05213v1 [cs.CV])" — A new computationally efficient neural network based on a light attention mechanism and sub-pixel convolution operations for real-time HDR imaging.

Paper: http://arxiv.org/abs/2302.05213
Code: https://github.com/steven-tel/CEN-HDR

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Qualitative comparison of the p…
0
2
1

Fahim Farook

"DOMINO: Domain-aware Loss for Deep Learning Calibration. (arXiv:2302.05142v1 [cs.CV])" — A domain-aware loss function to calibrate deep learning models so as to avoid the potential dangers of uncalibrated models in medical imaging.

Paper: http://arxiv.org/abs/2302.05142
Code: https://github.com/lab-smile/DOMINO

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Confusion matrices on testing s…
0
3
0
Show older