The Farooks

Fahim Farook

@f Admin Moderator

Posts

1639

Following

139

Followers

885

I'm a bit of an eclectic mess 🙂 I've been a programmer, journalist, editor, TV producer, and a few other things.

I'm currently working on my second novel which is complete, but is in the edit stage. I wrote my first novel over 20 years ago but then didn't write much till now.

I post about #Coding, #Flutter, #Writing, #Movies and #TV. I'll also talk about #Technology, #Gadgets, #MachineLearning, #DeepLearning and a few other things as the fancy strikes ...

Lived in: 🇱🇰🇸🇦🇺🇸🇳🇿🇸🇬🇲🇾🇦🇪🇫🇷🇪🇸🇵🇹🇶🇦🇨🇦

Books

https://shop.farook.org

Apps

https://pinkzombiestudios.com

Blog

https://write.farook.org

Pronouns

He/Him

Fahim Farook

f

That’s it for papers today — 9 papers boosted from the cs.CV category on arXiv.org out of a total of 93 papers.

#AI #CV #NewPapers #DeepLearning #MachineLearning

0

0

0

Fahim Farook

f

"Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation. (arXiv:2206.07771v2 [cs.CV] UPDATED)" — Synthesis of multiple types of content such as dance-to-music or text-to-image using a new diffusion mechanism, at fewer steps.

Paper: http://arxiv.org/abs/2206.07771
Code: https://github.com/l-yezhu/cdcd

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Examples of the input (left col…

0

1

0

Fahim Farook

f

"Write and Paint: Generative Vision-Language Models are Unified Modal Learners. (arXiv:2206.07699v2 [cs.CV] UPDATED)" — A unified model based on training a model to write and paint concurrently.

Paper: http://arxiv.org/abs/2206.07699

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Illustration of the overall arc…

0

1

0

Fahim Farook

f

"Towards Reliable Image Outpainting: Learning Structure-Aware Multimodal Fusion with Depth Guidance. (arXiv:2204.05543v2 [cs.CV] UPDATED)" — Reliable outpainting using depth-guidance.

Paper: http://arxiv.org/abs/2204.05543

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
A comparison of image outpainti…

0

1

0

Fahim Farook

f

"Text-driven Visual Synthesis with Latent Diffusion Prior. (arXiv:2302.08510v1 [cs.CV])" — Using diffusion models as the generic driver for diverse image generation tasks such as text-to3D, image editing, and StyleGAN adaptation.

Paper: http://arxiv.org/abs/2302.08510

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Applications of the proposed la…

0

2

0

Fahim Farook

repeated

Loren

loren@flipping.rocks

i boosted this site earlier but i think it deserves more explanation. this is a search engine for images that shows you the copyright status for the image right there front and center and you can filter for them. for instance i grabbed this copyright 0 image of an echidna from there. and it even generated a citation for me: "Echidna on the move" by CazzJj is marked with CC0 1.0. i think this is a really easy way for getting a reference image while ensuring the person who took the image consents to its use! https://openverse.org/

1

4

1

Fahim Farook

f

"3D-aware Conditional Image Synthesis. (arXiv:2302.08509v1 [cs.CV])" — Using a 2 input such as a segmentation or edge map to generate photo-realistic images from different perspectives/viewpoints.

Paper: http://arxiv.org/abs/2302.08509
Code: https://github.com/dunbar12138/pix2pix3D

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Given a 2D label map as input, …

0

2

1

Fahim Farook

f

"T2I-Adapter: Learning Adapters to Dig out More Controllable Ability for Text-to-Image Diffusion Models. (arXiv:2302.08453v1 [cs.CV])" — Controlling text-to-image diffusion models in a more granular fashion by using special adapters to provide extra guidance.

Paper: http://arxiv.org/abs/2302.08453
Code: https://github.com/TencentARC/T2I-Adapter

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
We propose T2I-Adapter, a simpl…

0

1

0

Fahim Farook

f

"MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation. (arXiv:2302.08113v1 [cs.CV])" — Controlling diffusion-based image generation so that you can specify image components, component placement etc. without any further fine-tuning.

Paper: http://arxiv.org/abs/2302.08113
Code: https://github.com/omerbt/MultiDiffusion

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
MultiDiffusion enables flexible…

0

1

0

Fahim Farook

f

"\`A-la-carte Prompt Tuning (APT): Combining Distinct Data Via Composable Prompting. (arXiv:2302.07994v1 [cs.LG])" — Having multiple subsets of data trained on specific prompts and being able to compose the final model based on the prompts you select.

Paper: http://arxiv.org/abs/2302.07994

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
A-la-carte Learning and APT. Gi…

0

2

0

Fahim Farook

f

"PRedItOR: Text Guided Image Editing with Diffusion Prior. (arXiv:2302.07979v1 [cs.CV])" — Structure preserving, text guided image editing using diffusion models without needing a base prompt, fine-tuning of models etc.

Paper: http://arxiv.org/abs/2302.07979

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Examples of text guided image e…

0

2

1

Fahim Farook

f

Reply to @memetzgz@fosstodon.org

@memetzgz If I wanted to read all of them, yes 😛 But I just skim through the summaries to find the ones that look interesting … At the moment, I’ve found 5 interesting papers and have whittled down the 93 to 39 remaining papers …

0

0

1

Fahim Farook

repeated

Jill Hamilton (inactive)

omgshutupjill@mstdn.ca

Edited 3 years ago

I saw this little guy yesterday and I can't stop thinking about him. So round. So pink. #borbs #borb

(It's an Australian pink robin and the photos are not mine.)

Credits:
1. © Deepak Karra http://instagram.com/ravi_arora/
2. © Ravi Arora http://instagram.com/ravi_arora/
3. © Ambika Angela Bone http://instagram.com/ambikangela/
4. © Tim J. Hopwood http://flickr.com/photos/timjhopwood/

42

3

0

Fahim Farook

f

A total of 93 papers in the cs.CV category on arXiv.org today — 61 new, 32 updated going in to the weekend …

So yesterday’s low paper count was definitely not a slowdown 😛

#AI #CV #NewPapers #DeepLearning #MachineLearning

1

0

0

Fahim Farook

f

Updated Akkoma on our server to the latest. New graphs, yay!! Hopefully, nothing broke though since I’m always scared of stuff breaking once I’ve updated since I’m always in a rush …

1

1

1

Fahim Farook

repeated

sdw

sdw@mastodon.social

I ended up testing our new Neural Telephoto feature by shooting with iPhone SE for a while. I loved the shots I got out of it w/ native RAW and the extra reach of the virtual 2×.

Death Valley / Owens Valley, California
iPhone SE, 1× / 2× (Neural Telephoto), @halide RAW

8

2

0

Fahim Farook

repeated

Dave

Dave2022@mastodonapp.uk

I was quietly minding my own business taking photos of Bullfinches (small/far away) in a tree when this Robin came and perched right in front of my camera and demanded to have his picture taken. How could I refuse?

#AngryBirds 🤣

#Birds #BirdWatching #Twitching #Nature #BirdsOfMastodon #Bird #Photo #Photography #Robin

1

3

0

Fahim Farook

repeated

Jean Gautier

Neoresistant@mamot.fr

#FleurisTonFil
La passiflore grande et belle fleur
#photographie #photography

2

1

0

Fahim Farook

repeated

Jason Coward

drumshaman@mas.to

The single-digit temperatures and a bit of sunshine created an ethereal ground fog this morning.

📆 Feb 16, 2023
📷 1/1600 s at f/11, ISO 180, 170mm

#photography #fog #winter #cold #nature #landscape #tree #nikon

0

3

1

Fahim Farook

repeated

Jim Fan

drjimfan@bird.makeup

The Adam optimizer is at the heart of modern AI. Researchers have been trying to dethrone Adam for years.

How about we ask a machine to do a better job? @googleai uses evolution to discover a simpler & efficient algorithm with remarkable features.

It’s just 8 lines of code: 🧵

0

2

0