Spaces:

Trickshotblaster
/

mike-chat

Sleeping

mike-chat / app.py

Minor update

766c9b2 over 1 year ago

1.95 kB

	import gradio as gr
	import gpt

	"""
	For information on how to customize the ChatInterface, peruse the gradio docs: https://www.gradio.app/docs/chatinterface
	"""
	demo = gr.Interface(fn=gpt.get_response, inputs=["textbox",
	gr.Slider(0, 100, value=50, step=1),
	gr.Slider(0.1, 2.0, value=1.0),
	gr.Dropdown(
	["mike-chat", "mike-code", "mike-code-600m"], value="mike-chat"),
	], outputs=gr.Markdown(line_breaks=True), title="Mike Chat", article="""
	Notice: if you have a GPU, I would highly recommend cloning the space and running it locally. The CPU provided by spaces isn't very fast.

	Mike is a small GPT-style language model. It was trained for about 8 hrs on my PC using fineweb-edu and open orca datasets. While it hallucinates a lot, it seems to be about on par with other LMs of its size (about 160M params). Model details:
	block_size: 512
	n_layers: 12
	n_heads: 12
	d_model: 768
	(Same as gpt-2 but without weight tying)

	Architecture for Mike-Code-600m:
	block_size: 256
	n_layers: 16
	n_heads: 12
	d_model: 1536""")


	if __name__ == "__main__":
	demo.launch()