Source Code

The sections below describe the core functionality of Scholaris, including helper functions for JSON schema generation, local file processing, external data retrieval from sources like NCBI, Semantic Scholar and OpenAlex, and an Assistant class for simplified chat and tool use. Follow this link to view the full source code.

Note

If you are reading this as part of the documentation pages, you can view a Jupyter notebook with the ‘literate’ source code and additional tests here. Scholaris was built to run with Llama 3.1 8B using the ollama framework.

Helper functions

The Ollama framework supports tool calling (also referred to as function calling) with models such as Llama 3.1. To leverage function calling, we need to pass the JSON schema for any given function as an argument to the LLM. This is the information based on which the LLM infers the most appropriate tool to use given a prompt, and which parameters/arguments to pass to a function. To simplify the process of generating JSON schemas, use the helper and decorator functions defined below.

	Type	Default	Details
status	dict	{}	The status of the assistant
sys_message	str	None	The system message for the assistant; if not provided, a default message is used
model	str	llama3.1:latest	The model to use for the assistant
tools	Dict	{‘get_file_names’: <function get_file_names at 0x7f6c2f669870>, ‘extract_text_from_pdf’: <function extract_text_from_pdf at 0x7f6c2f6696c0>, ‘get_titles_and_first_authors’: <function get_titles_and_first_authors at 0x7f6c2f66a170>, ‘summarize_local_document’: <function summarize_local_document at 0x7f6c2f66ac20>, ‘describe_python_code’: <function describe_python_code at 0x7f6c2f66b130>, ‘id_converter_tool’: <function id_converter_tool at 0x7f6c2f66bb50>, ‘query_openalex_api’: <function query_openalex_api at 0x7f6c2f66bc70>, ‘query_semantic_scholar_api’: <function query_semantic_scholar_api at 0x7f6c2f0c1480>, ‘respond_to_generic_queries’: <function respond_to_generic_queries at 0x7f6c2f0c17e0>}	The tools available to the assistant
add_tools	Dict	{}	Optional argument to add additional tools to the assistant, when initializing
authentication	Optional	None	Authentication credentials for API calls to external services
dir_path	str	../data	The directory path to which the assistant has access on the local computer
messages	List	[]	The conversation history

Helper functions

generate_json_schema

json_schema_decorator

Local file processing: listing, content extraction, and summarization

get_file_names

extract_text_from_pdf

extract_title_and_first_author

get_titles_and_first_authors

summarize_local_document

describe_python_code

External data retrieval from NCBI, OpenAlex and Semantic Scholar

id_converter_tool

detect_id_type

convert_id

query_openalex_api

query_semantic_scholar_api

respond_to_generic_queries

Assistant class

Assistant

Assistant.chat

Assistant.show_conversation_history

Assistant.clear_conversation_history

Assistant.pprint_tools

Assistant.get_status

add_to_class