Posts
1574
Following
138
Followers
878
I'm a bit of an eclectic mess πŸ™‚ I've been a programmer, journalist, editor, TV producer, and a few other things.

I'm currently working on my second novel which is complete, but is in the edit stage. I wrote my first novel over 20 years ago but then didn't write much till now.

I post about #Coding, #Flutter, #Writing, #Movies and #TV. I'll also talk about #Technology, #Gadgets, #MachineLearning, #DeepLearning and a few other things as the fancy strikes ...

Lived in: πŸ‡±πŸ‡°πŸ‡ΈπŸ‡¦πŸ‡ΊπŸ‡ΈπŸ‡³πŸ‡ΏπŸ‡ΈπŸ‡¬πŸ‡²πŸ‡ΎπŸ‡¦πŸ‡ͺπŸ‡«πŸ‡·πŸ‡ͺπŸ‡ΈπŸ‡΅πŸ‡ΉπŸ‡ΆπŸ‡¦πŸ‡¨πŸ‡¦

Fahim Farook

That’s it for papers today β€” 9 papers boosted from the cs.CV category on arXiv.org out of a total of 93 papers.

#AI #CV #NewPapers #DeepLearning #MachineLearning
0
0
0

Fahim Farook

"Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation. (arXiv:2206.07771v2 [cs.CV] UPDATED)" β€” Synthesis of multiple types of content such as dance-to-music or text-to-image using a new diffusion mechanism, at fewer steps.

Paper: http://arxiv.org/abs/2206.07771
Code: https://github.com/l-yezhu/cdcd

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too πŸ™‚>>
Examples of the input (left col…
0
1
0

Fahim Farook

"Write and Paint: Generative Vision-Language Models are Unified Modal Learners. (arXiv:2206.07699v2 [cs.CV] UPDATED)" β€” A unified model based on training a model to write and paint concurrently.

Paper: http://arxiv.org/abs/2206.07699

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too πŸ™‚>>
Illustration of the overall arc…
0
1
0

Fahim Farook

"Towards Reliable Image Outpainting: Learning Structure-Aware Multimodal Fusion with Depth Guidance. (arXiv:2204.05543v2 [cs.CV] UPDATED)" β€” Reliable outpainting using depth-guidance.

Paper: http://arxiv.org/abs/2204.05543

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too πŸ™‚>>
A comparison of image outpainti…
0
1
0

Fahim Farook

"Text-driven Visual Synthesis with Latent Diffusion Prior. (arXiv:2302.08510v1 [cs.CV])" β€” Using diffusion models as the generic driver for diverse image generation tasks such as text-to3D, image editing, and StyleGAN adaptation.

Paper: http://arxiv.org/abs/2302.08510

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too πŸ™‚>>
Applications of the proposed la…
0
2
0

i boosted this site earlier but i think it deserves more explanation. this is a search engine for images that shows you the copyright status for the image right there front and center and you can filter for them. for instance i grabbed this copyright 0 image of an echidna from there. and it even generated a citation for me: "Echidna on the move" by CazzJj is marked with CC0 1.0. i think this is a really easy way for getting a reference image while ensuring the person who took the image consents to its use! https://openverse.org/

1
4
1

Fahim Farook

"3D-aware Conditional Image Synthesis. (arXiv:2302.08509v1 [cs.CV])" β€” Using a 2 input such as a segmentation or edge map to generate photo-realistic images from different perspectives/viewpoints.

Paper: http://arxiv.org/abs/2302.08509
Code: https://github.com/dunbar12138/pix2pix3D

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too πŸ™‚>>
Given a 2D label map as input, …
0
2
1

Fahim Farook

"T2I-Adapter: Learning Adapters to Dig out More Controllable Ability for Text-to-Image Diffusion Models. (arXiv:2302.08453v1 [cs.CV])" β€” Controlling text-to-image diffusion models in a more granular fashion by using special adapters to provide extra guidance.

Paper: http://arxiv.org/abs/2302.08453
Code: https://github.com/TencentARC/T2I-Adapter

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too πŸ™‚>>
We propose T2I-Adapter, a simpl…
0
1
0

Fahim Farook

"MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation. (arXiv:2302.08113v1 [cs.CV])" β€” Controlling diffusion-based image generation so that you can specify image components, component placement etc. without any further fine-tuning.

Paper: http://arxiv.org/abs/2302.08113
Code: https://github.com/omerbt/MultiDiffusion

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too πŸ™‚>>
MultiDiffusion enables flexible…
0
1
0

Fahim Farook

"\`A-la-carte Prompt Tuning (APT): Combining Distinct Data Via Composable Prompting. (arXiv:2302.07994v1 [cs.LG])" β€” Having multiple subsets of data trained on specific prompts and being able to compose the final model based on the prompts you select.

Paper: http://arxiv.org/abs/2302.07994

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too πŸ™‚>>
A-la-carte Learning and APT. Gi…
0
2
0

Fahim Farook

"PRedItOR: Text Guided Image Editing with Diffusion Prior. (arXiv:2302.07979v1 [cs.CV])" β€” Structure preserving, text guided image editing using diffusion models without needing a base prompt, fine-tuning of models etc.

Paper: http://arxiv.org/abs/2302.07979

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too πŸ™‚>>
Examples of text guided image e…
0
2
1
@memetzgz If I wanted to read all of them, yes πŸ˜› But I just skim through the summaries to find the ones that look interesting … At the moment, I’ve found 5 interesting papers and have whittled down the 93 to 39 remaining papers …
0
0
1
Edited 2 years ago

I saw this little guy yesterday and I can't stop thinking about him. So round. So pink.

(It's an Australian pink robin and the photos are not mine.)

Credits:
1. Β© Deepak Karra http://instagram.com/ravi_arora/
2. Β© Ravi Arora http://instagram.com/ravi_arora/
3. Β© Ambika Angela Bone http://instagram.com/ambikangela/
4. Β© Tim J. Hopwood http://flickr.com/photos/timjhopwood/

42
3
0

Fahim Farook

A total of 93 papers in the cs.CV category on arXiv.org today β€” 61 new, 32 updated going in to the weekend …

So yesterday’s low paper count was definitely not a slowdown πŸ˜›

#AI #CV #NewPapers #DeepLearning #MachineLearning
1
0
0

Fahim Farook

Updated Akkoma on our server to the latest. New graphs, yay!! Hopefully, nothing broke though since I’m always scared of stuff breaking once I’ve updated since I’m always in a rush …
1
1
1

I ended up testing our new Neural Telephoto feature by shooting with iPhone SE for a while. I loved the shots I got out of it w/ native RAW and the extra reach of the virtual 2Γ—.

Death Valley / Owens Valley, California
iPhone SE, 1Γ— / 2Γ— (Neural Telephoto), @halide RAW

8
2
0

I was quietly minding my own business taking photos of Bullfinches (small/far away) in a tree when this Robin came and perched right in front of my camera and demanded to have his picture taken. How could I refuse?

🀣

1
3
0


La passiflore grande et belle fleur

2
1
0

The single-digit temperatures and a bit of sunshine created an ethereal ground fog this morning.

πŸ“† Feb 16, 2023
πŸ“· 1/1600 s at f/11, ISO 180, 170mm

0
3
1

The Adam optimizer is at the heart of modern AI. Researchers have been trying to dethrone Adam for years.

How about we ask a machine to do a better job? @googleai uses evolution to discover a simpler & efficient algorithm with remarkable features.

It’s just 8 lines of code: 🧡

0
2
0
Show older