Posts
1502
Following
136
Followers
877
I'm a bit of an eclectic mess 🙂 I've been a programmer, journalist, editor, TV producer, and a few other things.

I'm currently working on my second novel which is complete, but is in the edit stage. I wrote my first novel over 20 years ago but then didn't write much till now.

I post about #Coding, #Flutter, #Writing, #Movies and #TV. I'll also talk about #Technology, #Gadgets, #MachineLearning, #DeepLearning and a few other things as the fancy strikes ...

Lived in: 🇱🇰🇸🇦🇺🇸🇳🇿🇸🇬🇲🇾🇦🇪🇫🇷🇪🇸🇵🇹🇶🇦🇨🇦

Fahim Farook

Yesterday's Pratchett novel title was: "The Shepherd's Crown"

And that's the last of the #DiscWorld titles 😞 Sure there are a few others left like "Nation", "Dodger", and the "Bromeliad" stuff (not to mention "Johnny") but those don't really count as much here. I was reluctant to do this one since it feels (almost) like reading the last DiscWorld novel (and I haven't read anything much since then ...)

But time moves on and I guess we must too?

#AIArt #StableDiffusion #DeepLearning #MachineLearning #CV #AI
Prompt: “The Shepherd's Crown”.…
Prompt: “The Shepherd's Crown”.…
Prompt: “The Shepherd's Crown”.…
Prompt: “The Shepherd's Crown”.…
1
2
7

Fahim Farook

"Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation. (arXiv:2206.07771v2 [cs.CV] UPDATED)" — Synthesis of multiple types of content such as dance-to-music or text-to-image using a new diffusion mechanism, at fewer steps.

Paper: http://arxiv.org/abs/2206.07771
Code: https://github.com/l-yezhu/cdcd

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Examples of the input (left col…
0
1
0

Fahim Farook

"Write and Paint: Generative Vision-Language Models are Unified Modal Learners. (arXiv:2206.07699v2 [cs.CV] UPDATED)" — A unified model based on training a model to write and paint concurrently.

Paper: http://arxiv.org/abs/2206.07699

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Illustration of the overall arc…
0
1
0

Fahim Farook

"Towards Reliable Image Outpainting: Learning Structure-Aware Multimodal Fusion with Depth Guidance. (arXiv:2204.05543v2 [cs.CV] UPDATED)" — Reliable outpainting using depth-guidance.

Paper: http://arxiv.org/abs/2204.05543

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
A comparison of image outpainti…
0
1
0

Fahim Farook

"Text-driven Visual Synthesis with Latent Diffusion Prior. (arXiv:2302.08510v1 [cs.CV])" — Using diffusion models as the generic driver for diverse image generation tasks such as text-to3D, image editing, and StyleGAN adaptation.

Paper: http://arxiv.org/abs/2302.08510

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Applications of the proposed la…
0
2
0

Fahim Farook

"3D-aware Conditional Image Synthesis. (arXiv:2302.08509v1 [cs.CV])" — Using a 2 input such as a segmentation or edge map to generate photo-realistic images from different perspectives/viewpoints.

Paper: http://arxiv.org/abs/2302.08509
Code: https://github.com/dunbar12138/pix2pix3D

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Given a 2D label map as input, …
0
2
1

Fahim Farook

"T2I-Adapter: Learning Adapters to Dig out More Controllable Ability for Text-to-Image Diffusion Models. (arXiv:2302.08453v1 [cs.CV])" — Controlling text-to-image diffusion models in a more granular fashion by using special adapters to provide extra guidance.

Paper: http://arxiv.org/abs/2302.08453
Code: https://github.com/TencentARC/T2I-Adapter

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
We propose T2I-Adapter, a simpl…
0
1
0

Fahim Farook

"MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation. (arXiv:2302.08113v1 [cs.CV])" — Controlling diffusion-based image generation so that you can specify image components, component placement etc. without any further fine-tuning.

Paper: http://arxiv.org/abs/2302.08113
Code: https://github.com/omerbt/MultiDiffusion

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
MultiDiffusion enables flexible…
0
1
0

Fahim Farook

"\`A-la-carte Prompt Tuning (APT): Combining Distinct Data Via Composable Prompting. (arXiv:2302.07994v1 [cs.LG])" — Having multiple subsets of data trained on specific prompts and being able to compose the final model based on the prompts you select.

Paper: http://arxiv.org/abs/2302.07994

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
A-la-carte Learning and APT. Gi…
0
2
0

Fahim Farook

"PRedItOR: Text Guided Image Editing with Diffusion Prior. (arXiv:2302.07979v1 [cs.CV])" — Structure preserving, text guided image editing using diffusion models without needing a base prompt, fine-tuning of models etc.

Paper: http://arxiv.org/abs/2302.07979

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Examples of text guided image e…
0
2
1

Fahim Farook

Yesterday's Pratchett novel title prompt was: "Raising Steam".

Here's the thing about the prompt — I generated images on macOS initially and I was happy with the images I was getting since I was getting strange stuff. Nothing really to do with the prompt possibly, but all sorts of weird and wonderful landscapes 🙂

Then I switched to Windows for generation and suddenly all I'd get were trains or some sort of steam engine. Not a lot of variety ... No matter how many models I tried 😛

I've selected a mixed set from both sides for fair representation but I feel as if this needs more exploration ...

#AIArt #StableDiffusion #DeepLearning #MachineLearning #CV #AI #DiscWorld
Prompt: “Raising Steam”. A dark…
Prompt: “Raising Steam”. Some s…
Prompt: “Raising Steam”. A stra…
Prompt: “Raising Steam”. A dark…
0
2
7

Fahim Farook

"Forward Pass: On the Security Implications of Email Forwarding Mechanism and Policy" — How email forwarding can create security vulnerabilities and and allow spoofing.

Paper: https://arxiv.org/abs/2302.07287

#AI #NewPaper #Security
Example message with a FROM hea…
0
0
0

Fahim Farook

"AI Chat Assistants can Improve Conversations about Divisive Topics" — A study looking at how Large Language Models can improve conversations on divisive topics by making the participants feel understood.

Paper: https://arxiv.org/abs/2302.07268

#AI #NewPaper #DeepLearning #MachineLearning #Language #HumanComputerInteraction
Treated Conversation Flow: Resp…
0
1
0

Fahim Farook

"CiteSee: Augmenting Citations in Scientific Papers with Persistent and Personalized Historical Context" — A tool that uses a reader’s publishing, reading, and saving history to provide personalised context to citations in papers that they’re reading.

Paper: https://arxiv.org/abs/2302.07302

#NewPaper #HumanComputerInteraction
CiteSee augments inline citatio…
0
2
1

Fahim Farook

"Stitchable Neural Networks. (arXiv:2302.06586v2 [cs.LG] UPDATED)" — A way to combine different pretrained models to combine models of varying complexity and performance.

Paper: http://arxiv.org/abs/2302.06586

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Compared with previous scalable…
0
1
0

Fahim Farook

"Learning When to Say "I Don't Know". (arXiv:2209.04944v2 [cs.CV] UPDATED)" — A method to teach learning systems when they don't know something, or at least to identify areas of uncertainty. Perhaps this should be tried with ChatGPT and Bing to mitigate all the gaslighting? 😛

Paper: http://arxiv.org/abs/2209.04944
Code: https://github.com/osu-cvl/learning-idk

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
t-SNE plots of logits from a we…
0
4
4

Fahim Farook

"SoK: Anti-Facial Recognition Technology. (arXiv:2112.04558v2 [cs.CR] UPDATED)" — An analysis of the currently available Anti-Facial Recognition (AFR) research and the pros and cons of the different approaches.

Paper: http://arxiv.org/abs/2112.04558

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
The workflow of how facial reco…
0
0
0

Fahim Farook

"Team Triple-Check at Factify 2: Parameter-Efficient Large Foundation Models with Feature Representations for Multi-Modal Fact Verification. (arXiv:2302.07740v1 [cs.CL])" — Verifying facts across different modes (text and images) and types (claim and document).

Paper: http://arxiv.org/abs/2302.07740
Code: https://github.com/wwweiwei/Pre-CoFactv2-AAAI-2023

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
An example of each category in …
0
1
0

Fahim Farook

"Video Probabilistic Diffusion Models in Projected Latent Space. (arXiv:2302.07685v1 [cs.CV])" — Generating high-resolution and coherent video using diffusion models.

Paper: http://arxiv.org/abs/2302.07685

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
256×256 resolution, 128 frame v…
0
1
0

Fahim Farook

Yesterday's Pratchett novel title prompt was: "Snuff"

I didn't really expect much from that one since there isn't a lot to work with there. But I still got some interesting images 🙂

These are from multiple models and so the styles are a little different across images. But still, I thought the end results were interesting ...

#AIArt #StableDiffusion #DeepLearning #MachineLearning #CV #AI #DiscWorld
Prompt: “Snuff”. Colourful pict…
Prompt: “Snuff”. A huge creatur…
Prompt: “Snuff”. A colourful im…
Prompt: “Snuff”. A colourful bu…
0
0
2
Show older